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Preface 


Legend 


Manual Overview 


The VR5432™ microprocessor is an NEC VR Series™ RISC 
(reduced instruction set computer) microprocessor that implements 
the high-performance 64-bit MIPS® IV architecture. This manual 
describes the architecture and hardware functions of the VR5432 
microprocessor. 


Data significance: Higher on left and lower on right 
Active-high signal name: XXX 
Active-low signal name: XXX* 
Numeric representation: — binary ... XXXX or XXXX, 

decimal ... XXXX 

hexadecimal ... OXXXXX 
Prefixes representing an exponent of 2 (for address space or memory 
capacity): 


K (kilo) 2! = 1024 

M (mega) Digtael (SS 
G (giga) 239 = 10243 
T (tera) 270 = 1024" 


The manual is divided into two volumes. Volume | is the user manual, 
containing processor architectural and functional information and 
instructions. Volume 2 contains the instruction set information and 
appendixes. 


Volume I (U13751E) 


Chapter 1: Introduction provides an overview of the device 
features, CPU, Floating-Point Unit (FPU), and pipeline. 


Chapter 2: Signal Descriptions discusses the pin configuration and 
functions of the Vr5432 processor signals. 


Chapter 3: Pipeline describes the dual-issue instruction pipeline 
stages, delays, and interlock and exception handling. 
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Chapter 4: Memory Management Unit discusses the processor’s 
virtual and physical address spaces, the virtual-to-physical address 
translation, the translation lookaside buffer (TLB) process, and the 
system control coprocessor registers that provide the software 
interface to the TLB. 


Chapter 5: Cache Organization and Operation describes the cache 
memory’s place in the VR5432 memory configuration and individual 
cache organization. 


Chapter 6: CPU Exceptions describes the processor’s exception 
types, registers, vector offsets, processing handling, and interrupts. 


Chapter 7: Floating-Point Unit describes the FPU coprocessor, 
including the programming model, instruction set and formats, and 
the pipeline. 


Chapter 8: Floating-Point Exceptions discusses FPU exception 
types, exception trap processing, exception flags, saving and restoring 
states when handling an exception, and trap handlers for IEEE 
Standard 754 exceptions. 


Chapter 9: Bus Interface describes how the processor accesses the 
external resources needed to satisfy cache misses and uncached 
operations, while permitting an external agent access to some of the 
processor’s internal resources. 


Chapter 10: System Interface Transactions (Native Mode) 
describes processor and external requests in the native system 
interface protocol of the VR5432 processor. 


Chapter 11: System Interface Protocols (Native Mode) contains a 
cycle-by-cycle description of the system interface protocols for each 
type of processor and external request in the native protocol of the 
VR5432 processor. 


Chapter 12: System Interface Transactions (R43K Mode) This 
section describes processor and external requests as they occur in 
R43K (VR4300 compatibility) mode. 


Chapter 13: System Interface Protocols (R43K Mode) contains a 
cycle-by-cycle description of the system interface protocols for each 
type of processor and external request in R43K mode. 


Chapter 14: Initialization Interface describes the processor reset 
and initialization signals. 
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Chapter 15: Clock Interface describes the basic system clocks, 
SysClock and PClock, and Phase-Locked Loop (PLL) and Bypass 
PLL modes. 


Volume 2 (UI5397E) 


Chapter 16: Instruction Set Overview discusses the general 
attributes of the CPU, FPU, multimedia, and debugging instructions 
of the MIPS IV instruction set architecture (ISA) utilized by the 
VR5432 processor. 


Chapter 17: CPU Instruction Set describes the details of the CPU 
instructions. 


Chapter 18: Floating-Point Unit Instruction Set describes the 
details of the FPU instructions. 


Chapter 19: Multimedia Instruction Set describes the details of the 
multimedia instructions. 


Chapter 20: Debug and Test Features describes the VR5432 
processor’s debug and test functions, Debug mode, and debug 
instructions. 


Appendix A: Sublock Order describes how a block of data elements 
(bytes, halfwords, words, or doublewords) can be retrieved from 
storage in sequential or nonsequential (sub-block) order. 


Appendix B: Comparing the Vr4300, Vr5000, and Vr5432 
Processors delineates each processor’s attributes. 


Appendix C: PLL Analog Power Filtering illustrates the phase- 
locked loop circuit configuration. 


Appendix D: Instruction Hazards identifies the VR5432 instruction 
hazards that occur with certain instruction and event combinations 
(such as pipeline delays, cache misses, interrupts, and exceptions). 
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Related Documents 


See also the following documents. The related documents indicated 
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versions are not marked as such. 
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This chapter provides an overview of the instruction set architecture (ISA) utilized 
by the VR5432 processor. For detailed information on each instruction type, refer 
to the following chapters. 


¢ Chapter 17, CPU Instruction Set, on page 3 

¢ Chapter 18, Floating-Point Unit Instruction Set, on page 569 
¢ Chapter 19, Multimedia Instruction Set, on page 677 

¢ Chapter 20, Debug and Test Features, on page 737 
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16.1 Instruction Set Architecture 
The VR5432 processor executes the MIPS IV instruction set (a superset of the 
MIPS III instruction set) plus instructions added by NEC specifically for VR5432 
implementation. As Figure 16-1 illustrates, each new architecture level (or 


version) includes the former levels. Therefore, a processor implementing MIPS 
IV can also run MIPS I, MIPS II, or MIPS III binary programs without change. 


MIPS II 


Figure 16-1 MIPS Architecture Extensions 


The MIPS IV instruction set for the Vr5432 processor utilizes the following 
instruction types. 


e CPU instructions 
¢ Floating-point instructions 
¢ Multimedia instruction 


e Test and debug instructions 


In earlier MIPS architectures, coprocessor instructions were implementation 
dependent. In the MIPS IV architecture, the Coprocessor 3 instruction formats 
have been used for extensions to the floating-point instruction set. In the VR5432 
implementation, the Coprocessor 2 instruction formats have been used for 
implementation-specific instruction set extensions. The new MIPS IV, VRr5432 
processor-specific instructions are summarized and briefly explained in Section 
16.8. 
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16.2 Instruction Formats 


Each instruction consists of a single 32-bit word aligned on a word boundary. 
There are three instruction formats—immediate (I-type), jump (J-type), and 
Register (R-type). The use of a small number of instruction formats simplifies 
instruction decoding, allowing the compiler to synthesize more complicated (and 
less frequently used) operations and addressing modes from these three formats. 
See the subsequent instruction chapters for details on the formats of each 
instruction type. 


16.3 Load and Store Instructions 


Load and Store instructions are immediate (I-type) instructions that transfer data 
between the memory system and the general-purpose register sets in the CPU and 
coprocessors. There are separate instructions for different purposes: transferring 
variously sized fields, treating loaded data as signed or unsigned integers, 
accessing unaligned fields, selecting the addressing mode, and providing atomic 
memory updates (read-modify-write cycles). 


Regardless of byte ordering (big- or little-endian), the address of a halfword, 
word, or doubleword is the smallest byte address among the bytes forming the 
object. For big-endian ordering, this is the most-significant byte; for little-endian 
ordering, this is the least-significant byte. 


Except for the few specialized instructions listed in Table 17-2, Load and Store 
instructions must access naturally aligned objects. An attempt to load or store an 
object at an address that is not an even multiple of the size of the object will cause 
an Address Error exception. 


Load and Store operations have been added in each revision of the architecture: 
MIPS II 
¢ 64-bit coprocessor transfers 
e Atomic update 
MIPS III 
¢ 64-bit CPU transfer 
¢ Unsigned word load for the CPU 
MIPS IV: Register + r egister addressing mode for the FPU 


Table 16-1 and Table 16-2 tabulate the supported Load and Store operations and 
indicate the MIPS architecture level at which each operation was first supported. 
The instructions themselves are listed in the following sections. 
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Table 16-1 Load/Store Operations Using Register + Offset Addressing Mode 


Coprocessor 
oy (except 0) 
. Load Load 
Data Size Signed | Unsigned Store Load Store 
Byte I I I 
Halfword I I I 
Word I Il I I I 
Doubleword Il Ill II II 
Unaligned word I I 
Unaligned doubleword | III Il 
Linked word 
(atomic modify) - T 
Linked doubleword Ww W 
(atomic modify) 


Table 16-2 Load/Store Operations Using Register + Register Addressing Mode 


Data Size 


Load 


Floating-Point Coprocessor Only 


Store 


Word 


IV 


IV 


Doubleword 
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16.3.1 Delayed Load Instructions 


The MIPS I architecture defines delayed loads; an instruction scheduling 
restriction requires that an instruction immediately following a load into register 
Rn cannot use Rn as a source register. The time between the Load instruction and 
the time the data is available is the “load delay slot.” If no useful instruction can 
be put into the load delay slot, then a null operation (assembler mnemonic NOP) 
must be inserted. 


In MIPS I, this instruction scheduling restriction is removed. Programs will 
execute correctly when the loaded data is used by the instruction following the 
load, but this may require extra read cycles. Most processors cannot actually load 
data quickly enough for immediate use and the processor will be forced to wait 
until the data is available. Scheduling load delay slots can be desirable, both for 
performance and compatibility with earlier Vr Series processors. However, the 
scheduling of load delay slots is not required for correct operation of the 
processor. 


16.3.2 Defining Access Types 


Access type indicates the size of a VR5432 processor data item to be loaded or 
stored, as set by the Load or Store instruction opcode. 


Regardless of access type or byte ordering (endianness), the address given 
specifies the low-order byte in the addressed field. For a big-endian configuration, 
the low-order byte is the most-significant byte; for a little-endian configuration, 
the low-order byte is the least-significant byte. 


The access type, together with the three low-order bits of the address, define the 
bytes accessed within the addressed doubleword (shown in Table 16-3). Only the 
combinations shown in Table 16-3 are permissible; other combinations cause 
Address Error exceptions. 
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Table 16-3 Byte Access within a Doublew or 


324 


Areesa ae Bytes Accessed 
Mnemonic G3eene8 sir es Bama 0) Omer aie sued 0) 
ae baal lia lee Byte Byte 
Doubleword (7) | 0 | 0 | 0 1]2]3]4)516|7/7/6]5|4]3]21/1 
ae 0/0] o 1[2]/3]4]5 6 i « Aleoraene 
o/o]1 1/2}3]4]/slel7/7/6]s5/4]3]2l1 
0]/o0]/ofol1{2/3]4]s5 eal 5]/4/3]2]1]0 
Pe ool 0 M2 3 [4lslol7i7zfolsl4)3[2 
eae O | oO | olola Pelsia 43.) tao 
of1]1 3/4] 5lel7]/7]e]5]4]/3 
o/o]ofol1{2]3 3/2/1110 
Word (3) 
1] 0] 0 4/s5l6|7/7/6]5]4 
0]/o]ofol1 1] 0 
Triplebyte (2) all : : : : L 
1] 0] 0 4] 5 6/514 
Mmliccoals: s|6|7/7]6]5 
0]/o0]ofol1 1] 0 
Halfword (1) Pall meer ai 
1] 0] 0 4] 5 5] 4 
ial cee 6|7/7|6 
0]0]0/0 0 
o/o]1 1 1 
0/1] 0 2 2 
sees o/1]41 3 3 
1] 0] 0 4 4 
1Jol1 5 5 
Talia: Io 6 6 
ae ale 7147 
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Table 16-4 Access Type Specifications for Load/Store Instruction 


Access Type SysCmd (2:0) Meaning 
Doubleword 7 8 bytes (64 bits) 
Septibyte 6 7 bytes (56 bits) 
Sextibyte 5 6 bytes (48 bits) 
Quintibyte 4 5 bytes (40 bits) 
Word 3 4 bytes (32 bits) 
Triplebyte 2 3 bytes (24 bits) 
Halfword 1 2 bytes (16 bits) 
Byte 0 1 byte (8 bits) 
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16.4 Computational Instructions 


Computational instructions can be in either register (R-type) format, in which both 
operands are registers, or immediate (I-type) format, in which one operand is a 16- 
bit immediate. 


Two’s-complement arithmetic is performed on integers represented in two’ s- 
complement notation. There are signed versions of add, subtract, multiply, and 
divide operations. There are add and subtract operations, called “unsigned,” that 
are actually modulo arithmetic without overflow detection. There are unsigned 
versions of multiply and divide. There is a full complement of shift and logical 
operations. 


MIPS I provides 32-bit integers and 32-bit arithmetic. MI PSIII adds 64-bit 
integers and provides separate Arithmetic and Shift instructions for 64-bit 
operands. Logical operations are not sensitive to the width of the register. 


Computational instructions perform the following operations on register values: 


e = Arithmeti 


¢« Logical 
¢« Rotate 
¢ Shift 

e Multipl 
e« Divide 


e  Multiply-accumulat 
e Parallel operations on packed bytes 
These operations fit in the following six categories of computational instructions: 
¢ ALU immediate instruction 
e Three-operand register-type instructions 
¢ Rotate and Shift instructions 
¢ Multiply and Divide instructions 
¢  Multiply-accumulate instructions 


e Packed byte instructions 
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16.4.1 64-Bit Operations 


The VR5432 microprocessor has a 64-bit architecture that supports 32-bit 
operands. These operands must be sign extended. Opcodes are available for 32-bit 
operands for all of the basic arithmetic and logical instructions, such as: ADD, 
ADDU, SUB, SUBU, ADDI, SLL, SRA, and SLLV. Operations that don’t use 
sign-extended 32-bit values correctly are unpredictable. In addition, 32-bit data is 
stored sign extended in a 64-bit register. 
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16.5 


16.5.1 


16.5.2 
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Jump and Branch Instructions 


All Jump and Branch instructions have a delay slot of exactly one instruction. That 
is, the instruction immediately following a Jump or Branch instruction (the 
instruction occupying the delay slot) is executed while the target instruction is 
being fetched from the cache. A Jump or Branch instruction cannot be used in a 
delay slot; however, if they are used, the error is not detected and the results of 
such an operation are undefined. 


If an exception or interrupt prevents the completion of the instruction while it is 
in a delay slot, the hardware sets a virtual address to the EPC register at the point 
of the Jump or Branch instruction that precedes it. When exception or interrupt 
processing is complete and the program is restored, both the Jump and Branch 
instruction and the instruction in the delay slot are re-executed. 


Because Jump and Branch instructions may be re-executed after exception or 
interrupt processing, register 31 (the register in which the link address is stored) 
should not be used as a source register in Jump, Link/Branch, and Link 
instructions. 


Because instructions must be word-aligned, a Jump Register or Jump and Link 
Register instruction must use a register that contains an address where the low- 
order two bits are zero. If these low-order two bits are not zero, an Address Error 
Exception instruction at the Jump destination is fetched. 


Jump Instructions 


Subroutine calls in high-level languages are usually implemented with Jump or 
Jump and Link instructions, both of which are J-type instructions. In J-type 
format, the 26-bit target address is shifted left 2 bits and concatenated with the 
high-order 4 bits of the current program counter to form an absolute address. 


Returns, dispatches, and large cross-page jumps are usually implemented with the 
Jump Register or Jump and Link Register instructions. Both are R-type 
instructions that take the 64-bit byte address contained in one of the general- 
purpose registers. 


Branch Instructions 


All Branch instruction target addresses are calculated by adding the address of the 
instruction in the delay slot to the 16-bit offset (shifted left by 2 bits and sign- 
extended to 64 bits). All branches occur with a delay of one instruction. 
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If a conditional Branch Likely instruction is not taken, the instruction in the delay 
slot is nullified (i.e., discarded without affecting any data). 


16.6 Special Instructions 


Special instructions allow the software to initiate traps both conditionally and 
unconditionally. These instructions can cause System Call (SysCall), Breakpoint 
(Break), and Trap (Trap) conditions in the processor. SysCall and Break are 
unconditional, while Trap can specify a condition such as a Branch instruction. 
The Synchronize (Sync) instruction allows the software to ensure that all pending 
operations are complete. In the VrR5432 processor implementation, the Sync 
instruction is executed as an NOP. 


16.7 Coprocessor Instructions 


Coprocessors are alternate execution units with register files separate from the 
CPU. The MIPS architecture provides an abstraction for up to 4 coprocessor units, 
numbered 0 to 3. Each architecture level defines some of these coprocessors, as 
shown in Table 16-5. Coprocessor 0 is always used for system control and 
Coprocessor | is used for the floating-point unit. Other coprocessors are 
architecturally valid, but do not have a reserved use. Some coprocessors are not 
defined and their opcodes are either reserved or used for other purposes. 


Table 16-5 Coprocessor Definition and Use in the MIPS Architect ur 


MIPS Architecture Level 
Coprocessor I I Wi IV 
0 Sys. control |Sys. control | Sys. control | Sys. control 
FPU FP FP FPU 
2 Unused Unused Unused Unused 
3 Unused Unused Not defined on 1X) 


The coprocessors may have two register sets, Coprocessor general-purpose 
registers and coprocessor control registers, with each set containing up to 32 
registers. Coprocessor computational instructions may alter registers in either 
set. 
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System control for all MIPS processors is implemented as Coproce ssor0 (CPO), 
the system control coprocessor. It provides the processor control, memory 
management, and exception handling functions. The CPO instructions are specific 
to each CPU and are documented with the CPU-specific information. 


If a system includes a floating-point unit, it is implemented as coproces sor] 
(CP1). In MIPS I V, the FPU also uses the computation opcode space for 
Coprocessor unit 3, renamed COP1X. The FPU instructions are documented in 
Chapter 18. 
The coprocessor instructions are divided into two main groups: 
¢ Load and Store instructions that are reserved in the main opcode 
space 


* Coprocessor-specific operations that are defined entirely by the 
coprocessor 


16.7.1 Coprocessor Load and Store 


Load and Store instructions are not defined for CPO; the Move to/from 
Coprocessor instructions provide the only way to write and read the CPO registers. 


16.7.2 Coprocessor Operations 


Up to four coprocessors and their instructions are shown generically for 
coprocessor z. Within the operation main opcode, the coprocessor has further 
coprocessor-specific instructions encoded. 


Table 16-6 Coprocessor Operation Instruction 


Defined in 
MIPS... 


Mnemonic Description 


COPz Coprocessor z Operation I 
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16.8 Implementation-Specific Instructions 


16.8.1 Overview 


The MIPS IV instructions added by NEC for the Vr5432 processor enable the 
MIPS architecture to compete in the high-end numeric processing market, which 
has traditionally been dominated by vector architectures. 


Compound Multiply-Add instructions are included, taking advantage of the fact 
that most floating-point computations use the chained multiply-add paradigm. 
The intermediate multiply result is rounded before the addition is performed. 


A tegister + register addressing indexed mode for floating-point loads and stores 
eliminates the extra integer required in many array accesses. However, issuing a 
register + register load causes a one-cycle stall in the pipeline, which makes it 
useful only for compatibility with other MIPS IV implementations. Register + 
register indexed addressing for integer memory operations is not supported. 


A set of four conditional move operators allows floating-point arithmetic IF 
statements to be represented without branches. THEN and ELSE clauses are 
computed unconditionally and the results are placed in a temporary register. 
Conditional move operators then transfer the temporary results to their true 
register. Conditional moves must be able to test both integer and floating-point 
conditions in order to supply the full range of IF statements. Integer tests are 
performed by comparing a general-purpose register against a zero value. 


Because floating-point conditional moves test the floating-point condition codes, 
the VR5432 processor provides eight condition codes to give the compiler 
increased flexibility in scheduling the comparison and the conditional moves. 


Table 16-7 lists the new instructions that complete the MIPS IV instruction set; 
these instructions are described in Section 16.8.2 on page 333. 
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Table 16-7 MIPS IV Instruction Additions 


Instruction Definition 
BCIF Branch on FP condition code false 
BCIT Branch on FP condition code true 
BCIFL Branch on FP condition code false likely 
BCITL Branch on FP condition code true likely 
C.cond.fmt (cc) Floating-point compare 
LDXCl Load doubleword indexed to COP 1 
LWXC1 Load word indexed to COP 1 
MADD.fmt Floating-point multiply-add 
MOVF Move conditional on FP condition code false 
MOVN Move on register not equal to zero 
MOVT Move conditional on FP condition code true 
MOVZ Move on register equal to zero 
MOVF.fmt FP move conditional on condition code false 
MOVN.fmt FP move on register not equal to zero 
MOVT.fmt FP move conditional on condition code true 
MOVZ.fmt FP move conditional on register equal to zero 
MSUB.fmt Floating-point multiply-subtract 
NMADD.fmt Floating-point negative multiply-add 
NMSUB.fmt Floating-point negative multiply-subtract 
PREFX Prefetch indexed — register + register 
PREF Prefetch — register + offset 
RECIP.fmt Reciprocal 
RSQRT.fmt Reciprocal square root 
SDXCl1 Store doubleword indexed from COP1 
SWXCl Store word indexed from COP1 
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16.8.2 Implementation-Specific Instruction Descriptions 


This section describes the new instructions listed in Table 16-7. 


16.8.2.1 Branch on floating-point Coprocessor instructions 
BCIT: Branch on FP condition True 
BCI1F: Branch on FP condition False 
BCITL: Branch on FP condition True Likely 
BCIFL: Branch on FP condition False Likely 


The four Branch instructions are upwardly compatible extensions of the Branch 
on floating-point coprocessor instructions of the MIPS instruction set. The BC1T 
and BC1F instructions are extensions of MIPS I. BCITL and BCIFL are 
extensions of MIPS III. These instructions test one of eight floating-point 
condition codes. This encoding is upwardly compatible with previous MIPS 
architectures. 


The branch target address is computed from the sum of the address of the 
instruction in the delay slot and the 16-bit offset, shifted left two bits and sign 
extended to 64 bits. If the contents of the floating-point condition code specified 
in the instruction are equal to the test value, the target address is branched to with 
a delay of one instruction. If the conditional branch is not taken and the nullify 
delay bit in the instruction is set, the instruction in the branch delay slot is 
nullified. 


16.8.2.2 Floating-point Compare instructions 
C.cond.fmt: Compares the contents of two FPU registers 


The contents of the two FPU source registers specified in the instruction are 
interpreted and arithmetically compared. A result is determined based on the 
comparison and the conditions specified in the instruction. 
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16.8.2.4 
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Indexed floating-point Load instructions 
LWXC1: Load word indexed to Coprocessor | 
LDXC1: Load doubleword indexed to Coprocessor 1 


The two indexed floating-point Load instructions are exclusive to the MIPS IV 
instruction set and transfer floating-point data types from memory to the floating- 
point registers using the register + register addressing mode. There are no indexed 
loads to general-purpose registers. The contents of the general-purpose register 
specified by the base are added to the contents of the general-purpose register 
specified by the index to form a virtual address. The contents of the word or 
doubleword specified by the effective address are loaded into the floating-point 
register specified in the instruction. 


The region bits (63:62) of the effective address must be supplied by the base. If 
the addition alters these bits, an Address Error exception occurs. Also, if the 
address is not aligned, an Address Error exception occurs. 


Integer conditional Move instructions 

MOVT: Move conditional on condition code True 
MOVF: Move conditional on condition code False 
MOVN: Move conditional on register not equal to zero 
MOVZ: Move conditional on register equal to zero 


The four-integer Move instructions are exclusive to the MIPS IV instruction set 
and are used to test a condition code or a general-purpose register and then 
conditionally perform an integer move. The value of the floating-point condition 
code specified in the instruction by the 3-bit condition code specifier, or the value 
of the register indicated by the 5-bit general-purpose register specifier, is 
compared to zero. If the result indicates that the move should be performed, the 
contents of the specified source register are copied into the specified destination 
register. 
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16.8.2.5 Floating-point Multiply-Add instructions 
MADD: Floating-point Multiply-Add 
MSUB: Floating-point Multiply-Subtract 
NMADD: Floating-point Negative Multiply-Add 
NMSUB: Floating-point Negative Multiply-Subtract 


These four instructions are exclusive to the MIPS IV instruction set and 
accomplish two floating-point operations with one instruction. Each of these four 
instructions performs intermediate rounding. 


16.8.2.6 Floating-point conditional Move instructions 
MOVT.fmt: Floating-point conditional move on condition code True 
MOVF.fmt: Floating-point conditional move on condition code False 
MOVN.fmt: Floating-point conditional move on register not equal to zero 
MOVZ.fmt: Floating-point conditional move on register equal to zero 


The four floating-point Conditional Move instructions are exclusive to the MIPS 
IV instruction set and are used to test a condition code or a general-purpose 
register and then conditionally perform a floating-point move. The value of the 
floating-point condition code specified by the 3-bit condition code specifier, or the 
value of the register indicated by the 5-bit general-purpose register specifier, is 
compared to zero. If the result indicates that the move should be performed, the 
contents of the specified source register are copied into the specified destination 
register. All of these conditional floating-point move operations are non- 
arithmetic. Consequently, no IEEE-754 exceptions occur as a result of these 
instructions. 


Vr5432 Microprocessor User’s Manual 335, 


Chapter 16 


16.8.2.7 Prefetch instructions 
PREF: Register + offset format 
PREF: Register + register format 


The two Prefetch instructions are exclusive to the MIPS IV instruction set and 
allow the compiler to issue instructions early so the corresponding data can be 
fetched and placed as close as possible to the CPU. Each instruction contains a 
5-bit “hint” field that gives the coherency status of the line being prefetched. The 
line can be shared, exclusive clean, or exclusive dirty. The contents of the general- 
purpose register specified by the base are added either to the 16-bit sign-extended 
offset or to the contents of the general-purpose register specified by the index to 
form a virtual address. This address and “hint” field are sent to the cache controller 
and a memory access is initiated. 


The region bits (63:62) of the effective address must be supplied by the base. If 
the addition alters these bits, an Address Error exception occurs. The Prefetch 
instruction never generates TLB-related exceptions. The PREF instruction is 
considered a standard processor instruction, while the PREFX instruction is 
considered a standard Coprocessor | instruction. 


16.8.2.8 Reciprocal instructions 
RECIP.fmt: Reciprocal 
RSQRT.fmt: Reciprocal Square Root 


The Reciprocal instruction performs a reciprocal on a floating-point value. The 
reciprocal of the value in the floating-point source register is placed in a 
destination register. 


The Reciprocal Square Root instruction performs a reciprocal square root on a 
floating-point value. The reciprocal of the positive square root of a value in the 
floating-point source register is placed in a destination register. 


The VR5432 meets full IEEE accuracy requirements for the RECIP and RSQRT 
instructions. On the VR5432 microprocessor, the RECIP instruction has the same 
latency as a DIV instruction, but an RSQRT is faster than a SQRT followed by a 
RECIP. 


336 Vr5432 Microprocessor User’s Manual 


Instruction Set Overview 


16.8.2.9 Indexed floating-point Store instructions 
SWXC1: Store word indexed from Coprocessor 1 
SDXC1: Store doubleword indexed from Coprocessor 1 


The two indexed floating-point Store instructions are exclusive to the MIPS IV 
instruction set and transfer floating-point data types from the floating-point 
registers to memory using the register + register addressing mode. There are no 
indexed stores from general-purpose registers. The contents of the general- 
purpose register specified by the base are added to the contents of the general- 
purpose register specified by the index to form a virtual address. The contents of 
the floating-point register specified in the instruction are stored to the memory 
location specified by the effective address. 


The region bits (63:62) of the effective address must be supplied by the base. If 
the addition alters these bits, an Address Error exception occurs. Also, if the 
address is not aligned, an Address Error exception occurs. 


16.9 Integer Rotate Instructions 


The VR5432 processor adds a set of Rotate instructions that are not part of the 
standard MIPS instruction set. 


Table 16-8 Rotate Instruction 


Instruction Definition 

DROR Doubleword rotate right 
DROR32 Doubleword rotate right plus 32 
DRORV Doubleword rotate right variable 
ROR Rotate right 

RORV Rotate right variable 
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16.10 Integer Multiply-Accumulate Instructions 


The VR5432 processor includes a set of Multiply-Accumulate instructions that are 
not part of the standard MIPS instruction set. These instructions use half of the HI 
and LO registers together as a 64-bit accumulator, with the upper 32 bits of the 
accumulator mapped to the lower 32 bits of HI and the lower 32 bits of the 
accumulator mapped to the lower 32 bits of LO. These instructions perform no 
underflow or overflow detection and produce no exceptions. Table 16-9 lists these 


instructions. 


Table 16-9 Multiply-Accumulate Instruction Set Extensions 
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Instruction Definition 

MACC Multiply, accumulate, and move LO 

MACCHI Multiply, accumulate, and move HI 

MACCHIU Unsigned multiply, accumulate, and move HI 
MACCU Unsigned multiply, accumulate, and move LO 
MSAC Multiply, negate, accumulate, and move LO 
MSACHI Multiply, negate, accumulate, and move HI 
MSACHIU Unsigned multiply, negate, accumulate, and move HI 
MSACU Unsigned multiply, negate, accumulate, and move LO 
MUL Multiply and move LO 

MULHI Multiply and move HI 

MULHIU Unsigned multiply and move HI 

MULS Multiply, negate, and move LO 

MULSHI Multiply, negate, and move HI 

MULSHIU Unsigned multiply, negate, and move HI 

MULSU Unsigned multiply, negate, and move LO 

MULU Unsigned multiply and move LO 
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Table 16-10 Multiply-Accumulate Instruction Latency and Repeat Rat e 


Instruction Latency Repeat Rate 
MACC, MACCHI, MACCHIU, MACCU 3 1 
MSAC, MSACHI, MSACHIU, MSACU 3 1 
MUL, MULHI, MULHIU, MULU 3 1 
MULS, MULSHI, MULSHIU, MULSU 3 1 


Multimedia Extensions 


The VR5432 adds a set of instructions to operate on packed vectors of eight 8-bit 
unsigned integers. These instructions are described in Chapter 19. 


Table 16-11 Multimedia Extensions 
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Instruction Definition 
ADD.OB Vector add 
ALNI.OB Vector align 
AND.OB Vector AND 
C.EQ.OB Vector compare equal 
C.LE.OB Vector compare less than or equal 
C.LT.OB Vector compare less than 
MAX.OB Vector maximum 
MIN.OB Vector minimum 
MUL.OB Vector multiply 
MULA.OB Vector multiply-accumulate 
MULS.OB Vector multiply, negate, and accumulate 
MULSL.OB Vector multiply, negate, and load accumulator 
NOR.OB Vector NOR 
OR.OB Vector OR 
PICKF.OB Vector pick false 
PICKT.OB Vector pick true 
RZU.OB Vector scale, round, and clamp accumulator 
SHFL.MIXH.OB Vector element shuffle 
SHFL.MIXL.OB Vector element shuffle 
SHFL.PACH.OB Vector element shuffle 
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Table 16-11 Multimedia Extensions (continued) 


Instruction Definition 
SHFL.PACL.OB Vector element shuffle 
SLL.OB Vector shift left logical 
SRL.OB Vector shift right logical 
SUB.OB Vector subtract 
XOR.OB Vector XOR 


Debugging Instructions 


The VR5432 processor adds a set of instructions to control the on-chip debugging 
features described in Chapter 20. 


Table 16-12 Debug Instruction 


Instruction Definition 

DBREAK Debug break 

DRET Debug return 

MFDR Move from Debug register 
MTDR Move to Debug register 


Instruction Notation Conventions 


In the following instruction set chapters, all variable subfields in instruction 
formats (such as rs, rt, fs, ft, immediate, and so on) are shown in lowercase. 


For clarity, sometimes an alias is used for a variable subfield in the formats of 
specific instructions. For example, rs = base in the format for Load and Store 
instructions. Such an alias is always lowercase, since it refers to a variable 
subfield. 


In some instructions, the instruction subfields op and function have fixed 6-bit 
values. These instructions use an uppercase mnemonic. For instance, in the 
floating-point ADD instruction, op = COP1 and function = FADD. In other cases, 
a single field has both fixed and variable subfields, so the name contains both 
uppercase and lowercase characters. The actual encodings of all the mnemonics 
and the codes in the function fields are shown in the instruction chapters. The 
operation executed by each instruction is described in pseudocode notation, as 
described in Table 16-13. 
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Table 16-13 Instruction Operation Notatio n 


Symbol Meaning 

- Substitution assignment 

| Bit string concatenation 

x’ Repetition of bit string x with a y-bit string. x is always a 
single-bit value. 
Selection of bits y through z for bit string x. Little-endian bit 

Xy...Z notation is always used. If y is less than z, this expression is 
an empty (zero length) bit string. 

+ Two’s-complement or floating-point addition 

- Two’s-complement or floating-point subtraction 

i Two’s-complement or floating-point multiplication 

div Two’s-complement integer division 

mod Two’s-complement remainder 

/ Floating-point division 

< Two’s-complement less than comparison 

and Bitwise logical AND 

or Bitwise logical OR 

xor Bitwise logical XOR 

nor Bitwise logical NOR 

GPR[x] General-purpose register x. GPR (0) always reads as zero. 
Attempts to modify the contents of GPR (0) have no effect. 

CPR[z,x] Coprocessor unit z, general-purpose register x 

CCR[z,x] Coprocessor unit z, control register x 

COC[z] Coprocessor unit z condition signal 
Endian mode as configured at reset (0 — Little, 1 — Big). 
Specifies the byte order of the memory interface (see 

‘ : LoadMemory and StoreMemory), and the byte order of 

Biernaenhen Kernel and Supervisor modes. Controlled by the BE bit in 
the Configuration register, which can only be modified 
during reset initialization. 
Signal to reverse the byte order of Load and Store 

ReverseEndian instructions. This feature is available in User mode only, 
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Table 16-13 Instruction Operation Notations (continued) 


Symbol 


Meaning 


BigEndianCPU 


Endian mode for Load and Store instructions (0 > Little, 1 
— Big). 

In User mode, byte order can be reversed by setting the RE 
bit. The byte order is also affected by the BE bit in the 
Configuration register. BigEndianCPU is calculated as 
BigEndianMem XOR ReverseEndian. 


LLbit 


Bit showing synchronized state of instructions. Set by LL 
instruction, cleared by ERET instruction, and read by SC 
instruction. 


T+i: 


Indicates the time steps between operations. Each statement 
within a time step is defined to be executed in sequential 
order (instruction execution order may be changed by 
conditional branch and loop). Operations marked T + i: are 
executed at instruction cycle i from the start of execution of 
the instruction. Thus, an instruction that starts at time j 
executes operations marked T + i: at the time of the 7 + jth 
cycle. The order is not defined for instructions executed at 
the same time of operations. 
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The examples in Figure 16-2 illustrate the application of some of the instruction 
notations. 


Example #1 
GPRI[rt] < immediate || 016 


Sixteen zero bits are concatenated with an immediate value (typically 16 bits) and the 
32-bit string is assigned to general-purpose register rt. 


Example #2 
(immediate 5)'® || immediate;5 9 


Bit 15 (the sign bit) of an immediate value is extended for 16 bit positions, and th 
result is concatenated with bits 15 through 0 of the immediate value to form a 32-bit 
sign-extended value. 


Example #3 
CPR[1, ft] <— data 


Data is assigned to general-purpose register ft of CP1 (Floating-Point 
General-Purpose register FGR). 


Figure 16-2 Instruction Notation Examples 
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17.1 Introduction 


This chapter describes the instruction set architecture (ISA) for the central 
processing unit (CPU) in the MIPS IV architecture. (For a general overview of the 
VR5432 MIPS IV instruction set, see Chapter 16.) The CPU architecture defines 
the nonprivileged instructions that execute in User mode. It does not define 
privileged instructions providing processor control executed by the 
implementation-specific system control processor. Instructions for the floating- 
point unit are described in Chapter 18. 


|g iP Functional Instruction Groups 


CPU instructions are divided into the following functional instruction groups: 


Load and Store 

Arithmetic and Logic Unit (ALU) 
Jump and Branch 

Miscellaneous 


Coprocessor 
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Load and Store Instructions 


The instructions in Table 17-1 transfer data in bytes, halfwords, words, and 
doublewords. Signed and unsigned integers of different sizes are supported by 
load operations that either sign extend or zero extend the data loaded into the 
register. Load and Store instructions are not defined for CPO; the Move to/from 
coprocessor instructions provide the only way to write and read the CPO registers. 


Table 17-1 Normal CPU Load/Store Instructions 


Mnemonic Description Defined in MIPS... 
LB Load Byte I 
LBU Load Byte Unsigned I 
SB Store Byte I 
LH Load Halfword I 
LHU Load Halfword Unsigned I 
SH Store Halfword I 
LW Load Word I 
LWU Load Word Unsigned Il 
SW Store Word I 
LD Load Doubleword Il 
SD Store Doubleword Il 


Unaligned words and doublewords can be loaded or stored in only two 
instructions by using a pair of special instructions (Table 17-2). The Load 
instructions read the left-side or right-side bytes (left or right side of the register) 
from an aligned word and merge them into the correct bytes of the destination 
register. MIPS I, though it prohibits other use of loaded data in the load delay slot, 
permits LWL and LWR instructions targeting the same destination register to be 
executed sequentially. Store instructions select the correct bytes from a source 
register and update only those bytes in an aligned memory word (or doubleword). 
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Table 17-2. Unaligned CPU Load/Store Instructions 


Mnemonic Description Defined in MIPS... 
LWL Load Word Left I 
LWR Load Word Right I 
SWL Store Word Left I 
SWR Store Word Right I 
LDL Load Doubleword Left Il 
LDR Load Doubleword Right Il 
SDL Store Doubleword Left Il 
SDR Store Doubleword Right Il 
2A Atomic update Load and Store instructions 


Paired instructions, Load Linked and Store Conditional, can be used to perform an 
atomic read-modify-write access of word and doubleword cached memory 
locations. These instructions are used in carefully coded sequences to provide one 
of several synchronization primitives, including test-and-set, bit-level locks, 
semaphores, and sequencers/event counts. The individual instruction descriptions 
describe how to use them. 


Table 17-3 Atomic Update CPU Load/Store Instructions 


Mnemonic Description aa 
LL Load Linked Word II 
SC Store Conditional Word II 
LLD Load Linked Doubleword Il 
SCD Store Conditional Doubleword Il 
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V2 Computational Instructions 


L7s2251 Multiply and Divide instructions 


The Multiply and Divide instructions produce twice as many result bits as is 
typical with other processors and they deliver their results into the HI and LO 
special registers. Multiply produces a full-width product twice the width of the 
input operands; the low half is put in LO and the high half is put in HI. Divide 
produces both a quotient in LO and a remainder in HI. The results are accessed by 
instructions that transfer data between HI/LO and the general-purpose registers. 


Table 17-4 Multiply/Divide Instructions 


348 


Mnemonic Description Defined in MIPS... 
MULT Multiply Word I 
MULTU Multiply Unsigned Word I 
DIV Divide Word I 
DIVU Divide Unsigned Word I 
DMULT Doubleword Multiply Il 
DMULTU Doubleword Multiply Unsigned Il 
DDIV Doubleword Divide Il 
DDIVU Doubleword Divide Unsigned Il 
MFHI Move From HI I 
MTHI Move To HI I 
MFLO Move From LO I 
MTLO Move To LO I 
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Cycle Timing for Computational Instructions 


The Vr5432A processor performs most computational instructions with the 
exception of Multiply and Divide instructions in a single processor cycle 
(PCycle). Multiply and Divide instructions require multiple iterations in the 
functional units and require multiple processor cycles to execute. Also, Divide and 
some Multiply instructions require the use of the MFLO and MFHI instructions to 
move the result back to the general register file. Since Multiply and Divide 
instructions can be executed in parallel with other nondependent instructions, it is 
desirable to schedule nondependent operations to gain performance. The 
VR5432A will automatically interlock the pipe when a dependency on a 
multicycle instruction is detected. 


Table 17-5 gives the number of processor cycles (PCycles) required to execute 
and resolve a stall between Multiply or Divide instructions, and a subsequent 
dependent instruction. 


Table 17-5 Multiply and Divide Instruction Latency and Repeat Rates 


Latency '/Repeat Rate 
Instruction (Cycles)/(Cycles) 

Word Long 
DIV / DIVU / DDIV / DDIVU 42/42 74/74 
MACC / MACCHI / MACCHIU / MACCU 3/1 
MSAC / MSACHI / MSACHIU / MSACU 3/1 
MUL / MULHI / MULHIU / MULU 3/1 
MULS / MULSHI/ MULSHIU / MULSU 3/1 
MULT / MULTU / DMULT / DMULTU 3/1 4/2 
Note: 


1. Latency of the accumulator for back-to-back Multiply-accumulate instructions is 1 cycle. 
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ALU instructions 


Some Arithmetic and Logical instructions operate on one operand from a register 
and the other from a 16-bit immediate value in the instruction word. The 
immediate operand is treated as signed for the Arithmetic and Compare 
instructions, and as logical (zero extended to register length) for the Logical 


instructions. 


Table 17-6 ALU Instructions With an Immediate Op eran 


Mnemonic Description Defined in MIPS... 
ADDI Add Immediate Word I 
ADDIU Add Immediate Unsigned Word I 
SLTI Set on Less Than Immediate I 
SLTIU Set on Less Than Immediate Unsigned I 
ANDI AND Immediate I 
ORI OR Immediate I 
XORI Exclusive OR Immediate I 
LUI Load Upper Immediate I 
DADDI Doubleword Add Immediate Il 
DADDIU Doubleword Add Immediate Unsigned Il 
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Table 17-7 Three-Operand ALU Instructions 
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Mnemonic Description point 
ADD Add Word I 
ADDU Add Unsigned Word I 
SUB Subtract Word I 
SUBU Subtract Unsigned Word I 
DADD Doubleword Add net 
DADDU Doubleword Add Unsigned Ii 
DSUB Doubleword Subtract net 
DSUBU Doubleword Subtract Unsigned net 
SLT Set on Less Than I 
SLTU Set on Less Than Unsigned I 
AND AND I 
OR OR I 
XOR Exclusive OR I 
NOR NOR I 
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Shift instructions 


There are Shift instructions that take the shift amount from a 5-bit field in the 
instruction word and Shift instructions that take a shift amount from the low-order 
bits of a general-purpose register. The instructions with a fixed shift amount are 
limited to a 5-bit shift count, so there are separate instructions for doubleword 
shifts of 0-31 bits and 32-63 bits. 


Table 17-8 Shift Instructions 


Mnemonic Description Defined in MIPS... 
SLL Shift Word Left Logical I 
SRL Shift Word Right Logical I 
SRA Shift Word Right Arithmetic I 
SLLV Shift Word Left Logical Variable I 
SRLV Shift Word Right Logical Variable I 
SRAV Shift Word Right Arithmetic Variable | I 
DSLL Doubleword Shift Left Logical Il 
DSRL Doubleword Shift Right Logical Il 
DSRA Doubleword Shift Right Arithmetic | TI 
DSLL32 Doubleword Shift Left Logical + 32 | HI 
DSRL32 Doubleword Shift Right Logical + 32 | HI 
DSRA32 ne Shift Right Arithmetic + I 
DSLLV mee Shift Left Logical W 
DSRLV Doubleword Shift Right Logical ll 

Variable 
DSRAV eee. Shift Right Arithmetic W 
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Table 17-9 Jump Instructions Jumping Within a 256 MB Regio 


Mnemonic Description Defined in MIPS... 
J Jump I 
JAL Jump and Link I 


Table 17-10 Jump Instructions to Absolute Address 


Mnemonic Description Defined in MIPS... 
JR Jump Register I 
JALR Jump and Link Register I 


Table 17-11 PC-Relative Conditional Branches Comparing Two Registers 


Mnemonic Description ins 
BEQ Branch on Equal I 
BNE Branch on Not Equal I 
BLEZ Branch on Less Than or Equal to Zero I 
BGTZ Branch on Greater Than Zero I 
BEQL Branch on Equal Likely II 
BNEL Branch on Not Equal Likely II 
BLEZL Branch on Less Than or Equal to Zero Likely | II 
BGTZL Branch on Greater Than Zero Likely II 
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Table 17-12 PC-Relative Conditional Branches Comparing Against Zer 


: rer Defined in 
Mnemonic Description MIPS... 
BLTZ Branch on Less Than Zero I 
BGEZ Branch on Greater Than or Equal to Zero I 
BLTZAL | Branch on Less Than Zero and Link I 
BGEZAL | Branch on Greater Than or Equal to Zero and Link | I 
BLTZL Branch on Less Than Zero Likely II 
BGEZL Branch on Greater Than or Equal to Zero Likely _| II 
BLTZALL | Branch on Less Than Zero and Link Likely II 
BGEZALL Branch on Greater Than or Equal to Zero and Link Il 
Likely 
17.2.4 Miscellaneous Instructions 
17.2.4.1 Exception instructions 


Exception instructions cause exceptions that will transfer control to a software 
exception handler in the kernel. System Call and Breakpoint instructions cause 
exceptions unconditionally. Trap instructions cause exceptions based upon the 
result of a comparison. 


Table 17-13 System Call and Breakpoint Instructions 
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Mnemonic Description Defined in MIPS... 
SYSCALL System Call I 
BREAK Breakpoint I 
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Table 17-14 Trap-on-Condition Instructions Comparing Two Register 


Mnemonic Description Defined in MIPS... 
TGE Trap if Greater Than or Equal II 
TGEU Trap if Greater Than or Equal Unsigned __| II 
TLT Trap if Less Than II 
TLT Trap if Less Than Unsigned II 
TEQ Trap if Equal II 
TNE Trap if Not Equal II 


Table 17-15 Trap-on-Condition Instructions Comparing an Immediate 


Mnemonic Description peers a 
TGEI Trap if Greater Than or Equal Immediate II 
TGEIU Trap if Greater Than or Equal Unsigned Immediate | IT 
TLTI Trap if Less Than Immediate II 
TLTIU Trap if Less Than Unsigned Immediate II 
TEQI Trap if Equal Immediate II 
TNEI Trap if Not Equal Immediate II 
17.2.4.2 Conditional Move instructions 


Instructions were added in MIPS IV to move one CPU general-purpose register to 
another, based on the value in a third general-purpose register. 


Table 17-16 CPU Conditional Move Instructio n 


: ee Defined in 
Mnemonic Description MIPS... 
MOVN Move Conditional on Not Zero IV 
MOVZ Move Conditional on Zero IV 
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System Control Coprocessor (CP0) Instructions 


There are some limitations imposed on operations involving a CPO that is 
incorporated within the CPU. Although Load and Store instructions to transfer 
data to and from coprocessors and to exchange control codes to and from 
coprocessor instructions are generally permitted by the MIPS architecture, CPO is 
given a somewhat protected status because it has responsibility for exception 
handling and memory management. Therefore, the coprocessor transfer 
instructions are the only valid way of writing to and reading from the CPO 
registers. 


Some CPO instructions are defined to directly read, write, and probe TLB entries 
and to change the operating modes in preparation for restoring to User mode or 
interrupt-enabled states. 


CPU Instructions 


This section describes in detail each function of the CPU instructions in 32- or 64- 
bit mode. Exceptions that may occur are listed at the end of each instruction’s 
description. For details regarding CPU exceptions and exception processing, refer 
to Chapter 6. 
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ADD Add ADD 
31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 ADD 
000000 00000 100000 
6 5 5 5 5 6 
Format: 
ADD rd, rs, rt (MIPS I format) 
Description: 
The contents of general-purpose register rs are added to the contents of general- 
purpose register rt. The result is stored in general-purpose register rd. In 64-bit 
mode, the operands must be sign-extended, 32-bit values. 
An Integer Overflow exception occurs if the carries-out of bits 30 and 31 differ 
(two’s-complement overflow). The contents of destination register rd are not 
modified when an Integer Overflow exception occurs. 
Operation: 
32 TT: GPR{[rd] < GPRirs] + GPR[rt] 
64 T: temp < GPR[rs] + GPRirt] 
GPRI[rd] < (tempsi)** || temps}...0 
Exceptions: 


Integer Overflow exception 
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ADDI 


Add Immediate A D D [ 


31 26 25 21 20 16 15 0 
ADDI rs rt immediate 
001000 
6 5 5 16 

Format: 
ADDI rt, rs, immediate (MIPS I format) 

Description: 
The 16-bit immediate is sign extended and added to the contents of general- 
purpose register rs. The result is stored in general-purpose register rt. In 64-bit 
mode, the operand must be sign-extended, 32-bit values. 
An Integer Overflow exception occurs if the carries-out of bits 30 and 31 differ 
(two’s-complement overflow). The contents of destination register rt are not 
modified when an Integer Overflow exception occurs. 

Operation: 

32 T: GPR [rt] — GPR[rs] +(immediate,5) '® || immediate;s,_ 0 

64 T: temp < GPR[rs] + (immediate 45)*8 || immediate15. 9 

GPR[rt] < (temp31)** || tempsi...0 
Exceptions: 
Integer Overflow exception 
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Add Immediate Unsigned ADDI U 


31 


26 25 21 20 16 15 0 


ADDIU 
00100 


1 


rs rt immediate 


6 


5 5 16 


Format: 


ADDIU rt, rs, immediate (MIPS I format) 


Description: 


The 16-bit immediate is sign extended and added to the contents of general- 
purpose register rs. The result is stored in general-purpose register rt. No Integer 
Overflow exception occurs under any circumstance. In 64-bit mode, the operand 
must be sign-extended, 32-bit values. 


The only difference between this instruction and the ADDI instruction is that the 
ADDIU instruction never causes an Integer Overflow exception. 


Operation: 


32 


64 


T 


T: 


: GPR [rt] — GPR[rs] + (immediate,5) '© || immediate;s5 0 


temp < GPR[rs] + (immediate ;5)*8 || immediate;5 6 
GPR{[rt] < (temp31)* || tempsy_..o 


Exceptions: 


None 
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ADDU 


Add Unsigned ADDU 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 ADDU 
000000 00000 100001 
6 5 5 5 5 6 

Format: 
ADDU rd, rs, rt (MIPS I format) 

Description: 
The contents of general-purpose register rs are added to the contents of general- 
purpose register rt. The result is stored in general-purpose register rd. No Integer 
Overflow exception occurs under any circumstance. In 64-bit mode, the operands 
must be sign-extended, 32-bit values. 
The only difference between this instruction and the ADD instruction is that the 
ADDU instruction never causes an Integer Overflow exception. 

Operation: 

32 T: GPR[rd] <GPR[rs] + GPR[rt] 

64 T: temp < GPR[rs] + GPR[rt] 

GPRI[rd] < (tempgi)** || tempsi...0 

Exceptions: 

None 
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AND AND AND 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 AND 
000000 00000 100100 
6 5 5 5 5 6 
Format: 
AND rd, rs, rt (MIPS I format) 
Description: 


The contents of general-purpose register rs are bitwise ANDed with the contents 
of general-purpose register rt. The result is stored in general-purpose register rd. 


Operation: 


32 T: GPRi[rd] < GPR[rs] and GPRirt] 


64 TT:  GPRird] < GPR[rs] and GPR{[rt] 


Exceptions: 


None 
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ANDI AND Immediate ANDI 


31 26 25 21 20 16 15 0 
ANDI rs rt immediate 
001100 
6 5 5 16 
Format: 
ANDI rt, rs, immediate (MIPS I format) 
Description: 


The 16-bit immediate is zero extended and bitwise ANDed with the contents of 
general-purpose register rs. The result is stored in general-purpose register rt. 


Operation: 


32 T: GPRIrt] — 0'° || (immediate and GPRIrs]1s...0) 


64. T:  GPRIirt] — 0*8|| (immediate and GPRIrsh15._.0) 


Exceptions: 


None 
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B EQ Branch on Equal B EQ 


31 26 25 21 20 16 15 0 


rs rt offset 


6 5 5 16 


Format: 
BEQ rs, rt, offset (MIPS I format) 


Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. The 
contents of general-purpose register rs and the contents of general-purpose 
register rf are compared. If the two registers are equal, then the program branches 
to the branch address with a delay of one instruction. 


Operation: 


32 TT: target < (offset,s)'* || offset || 0° 
condition <— (GPR[rs] = GPR[rt]) 
T+1: if condition then 
PC < PC + target 
endif 
64 T: target < (offset;;)* || offset || 07 
condition <— (GPR[rs] = GPR[rt]) 
T+1: if condition then 
PC < PC + target 
endif 


Exceptions: 


None 
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B EQ L Branch on Equal Likely B E Q L 


31 26 25 21 20 16 15 0 
BEQL rs rt offset 
010100 
6 5 5 16 
Format: 
BEQL rs, rt, offset (MIPS II format) 
Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. The 
contents of general-purpose register rs and the contents of general-purpose 
register rt are compared. If the two registers are equal, the program branches to 
the branch address with a delay of one instruction. 


If it does not branch, the instruction in the delay slot is discarded. 


Operation: 


32 TT: target < (offset;s)'* || offset || 0° 
condition — (GPR[rs] = GPR[rt]) 
T+1: if condition then 
PC < PC + target 


NullifyCurrentInstruction 


else 


endif 
64 T: target < (offset,s)“° || offset || 0° 
condition < (GPR[rs] = GPRIrt]) 
T+1: if condition then 


PC < PC + target 
else 

NullifyCurrentinstruction 
endif 


Exceptions: 


None 
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BGEZ 


31 26 25 21 20 16 15 0 
REGIMM rs BGEZ offset 
000001 00001 
6 5 5 16 
Format: 
BGEZ rs, offset (MIPS I format) 
Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. If the 
contents of general-purpose register rs are equal to or greater than 0, then the 
program branches to the branch address with a delay of one instruction. 


Operation: 


32 T: target < (offset;s)'* || offset || 0° 


condition <— (GPR[rs]3; = 0) 


T+1: if condition then 


PC < PC + target 
endif 
target < (offset,s)° || offset || 07 
condition <— (GPR[rs]g3 = 0) 


T+1: if condition then 


PC < PC + target 
endif 


Exceptions: 


None 
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BGEZAL 


Branch on Greater Than 
or Equal to Zero and Link 


BGEZAL 


31 26 25 21 20 16 15 0 
REGIMM rs BGEZAL offset 
000001 10001 
6 5 5 16 
Format: 


BGEZAL rs, offset 


Description: 


(MIPS I format) 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. 
Unconditionally, the address of the instruction after the delay slot is stored in the 


link register, r3/. If the contents of general-purpose register rs are equal to or 
greater than 0, then the program branches to the branch address, with a delay of 
one instruction. 


Usually, general-purpose register r3/ should not be specified as general-purpose 
register rs, because the contents of rs are overwritten by storing the link address, 


and then it may not be re-executable. An attempt to use r3/ does not cause an 
exception, however. 


Operation: 


32 T: 


T+1: 


64 T: 


T+1: 


target < (offset,s)'* || offset || 07 


condition <— (GPR[rs]3; = 0) 
GPR[31] — PC + 8 
if condition then 

PC < PC + target 
endif 
target < (offset;s)© || offset || 07 
condition — (GPR[rs]g3 = 0) 
GPR[31] — PC + 8 
if condition then 

PC < PC + target 
endif 


Exceptions: 


None 
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BGEZALL 


31 26 25 21 20 16 15 0 
REGIMM rs BGEZALL offset 
000001 10011 
6 5 5 16 
Format: 


BGEZALL rs, offset 


Description: 


(MIPS II format) 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. 
Unconditionally, the address of the instruction after the delay slot is stored in the 
link register, r3/. If the contents of general-purpose register rs are equal to or 
greater than 0, then the program branches to the branch address, with a delay of 
one instruction. When it does not branch, the instruction in the delay slot is 
discarded. Usually, general-purpose register r3/ should not be specified as 
general-purpose register rs, because the contents of rs are overwritten by storing 
the link address, and then it may not be re-executable. An attempt to use r3/ does 
not cause an exception, however. 


Operation: 
32 TT: target < (offset;s)'* || offset || 0° 
condition <— (GPR[rs]3; = 0) 
GPR[31] <— PC +8 
T+1: if condition then 
PC < PC + target 
else 
NullifyCurrentinstruction 
endif 
64 T: target < (offset,;)*° || offset || 07 
condition <— (GPR[rs]g3 = 0) 
GPR[31] <— PC +8 
T+1: if condition then 
PC < PC + target 
else 
NullifyCurrentinstruction 
endif 
Exceptions: 
None 


Vr5432 Microprocessor User’s Manual 


367 


Chapter 17 


Branch on Greater 
B G EZ L Than or Equal to Zero Likely BG EZ L 


31 26 25 21 20 16 15 0 
REGIMM rs BGEZL offset 
000001 00011 
6 5 5 16 
Format: 
BGEZL rs, offset (MIPS II format) 
Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. If the 
contents of general-purpose register rs are equal to or greater than 0, then the 
program branches to the branch address, with a delay of one instruction. 


If it does not branch, the instruction in the delay slot is discarded. 


Operation: 


32 T: 


T+1: if condition then 


T+1: if condition then 


target < (offset,s)'* || offset || 0° 
condition < (GPR[rs]3; = 0) 


PC < PC + target 
else 


NullifyCurrentInstruction 
endif 


target < (offset;s)“° || offset || 07 
condition < (GPR[rs]g3 = 0) 


PC < PC + target 


NullifyCurrentinstruction 
endif 


Exceptions: 
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BGTZ Branch on Greater Than Zero BGTZ 


31 26 25 21 20 16 15 0 
BGTZ rs 0 offset 
000111 00000 
6 5 5 16 


Format: 
BGTZ rs, offset (MIPS I format) 


Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. If the 
contents of general-purpose register rs are greater than 0, then the program 
branches to the branch address, with a delay of one instruction. 


Operation: 


32 TT: target < (offset;s)'* || offset || 0° 
condition < (GPR[rs]31 = 0) and (GPR[rs] # 0°) 
T+1: if condition then 


PC < PC + target 
endif 


64 T: target < (offsetys)** || offset || 07 
condition < (GPR[rs]g3 = 0) and (GPR[rs] + 0°) 
T+1: if condition then 


PC < PC + target 
endif 


Exceptions: 


None 
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BGTZL Than Zero Likely BGTZL 


31 26 25 21 20 16 15 0 
BGTZL rs 0 offset 
010111 00000 
6 5 5 16 
Format: 
BGTZL rs, offset (MIPS II format) 
Description: 
A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. If the 
contents of general-purpose register rs are greater than 0, then the program 
branches to the branch address, with a delay of one instruction. 
If it does not branch, the instruction in the delay slot is discarded. 
Operation: 
32 TT: target < (offset,s)'* || offset || 0° 
condition < (GPR[rs]31 = 0) and (GPR[rs] # 0°) 
T+1: if condition then 
PC < PC + target 
else 
NullifyCurrentinstruction 
endif 
64 T: target < (offset,;)*© || offset || 07 
condition < (GPR[rs]g3 = 0) and (GPR[rs] # 0%) 
T+1: if condition then 
PC < PC + target 
else 
NullifyCurrentinstruction 
endif 
Exceptions: 
None 
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Branch on Less Than 
BLEZ or Equal to Zero BLEZ 
31 26 25 21 20 16 15 0 
BLEZ rs 0 offset 
000110 00000 
6 5 5 16 
Format: 
BLEZ rs, offset (MIPS I format) 
Description: 


A branch address is calculated from the sum of the address of the instruction in the 


delay slot and the 16-bit offset, shifted two bits left and sign extended. If the 


contents of general-purpose register rs are equal to or less than 0, then the program 


branches to the branch address, with a delay of one instruction. 


Operation: 


32 TT: target < (offset;s)'4 || offset || 07 


T+1: if condition then 
PC < PC + target 
endif 


64. T: target < (offset;s)*® || offset || 07 


T+1: if condition then 


PC < PC + target 
endif 


condition < (GPR[rs]s1 = 1) or (GPR[rs] = 0°) 


condition < (GPR[rs]g3 = 1) and (GPR[rs] = 0°) 


Exceptions: 


None 
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BLEZL 


Branch On Less Than 
or Equal to Zero Likely 


BLEZL 


31 26 25 21 20 16 15 0 
BLEZL rs 0 offset 
010110 00000 
6 5 5 16 
Format: 
BLEZL rs, offset (MIPS II format) 
Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. If the 
contents of general-purpose register rs are equal to or less than 0, then the program 


branches to the branch address, with a delay of one instruction. 


If it does not branch, the instruction in the branch delay slot is discarded. 


Operation: 


32 T: 


T+1: 


64 T: 


T+1: 


target < (offset;s) '4 || offset || 0° 
condition < (GPR[rs]31 = 1) or (GPR[rs] = 0°") 
if condition then 
PC < PC + target 
else 
NullifyCurrentinstruction 
endif 
target < (offset,s)“° || offset || 0° 
condition < (GPR[rs]g3 = 1) and (GPR[rs] = 0%) 
if condition then 
PC < PC + target 
else 
NullifyCurrentinstruction 
endif 


Exceptions 


None 
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B LTZ Branch on Less Than Zero B LTZ 


31 26 25 21 20 16 15 0 
REGIMM rs BLTZ offset 
000001 00000 
6 5 5 16 
Format: 
BLTZ rs, offset (MIPS I format) 
Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. If the 
contents of general-purpose register rs are less than 0, then the program branches 
to the branch address, with a delay of one instruction. 


Operation: 


32 TT: target < (offset;s)'* || offset || 07 
condition <— (GPR[rs]3; = 1) 
T+1: if condition then 
PC < PC + target 
endif 
64 T: target < (offset;;)** || offset || 0° 
condition <— (GPRI[rs]g3 = 1) 
T+1: if condition then 
PC < PC + target 


endif 


Exceptions: 


None 
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BLTZAL 


Branch on Less 


Than Zero and Link 


BLTZAL 


31 26 25 21 20 16 15 0 
REGIMM rs BLTZAL offset 
000001 10000 
6 5 5 16 
Format: 
BLTZAL rs, offset (MIPS I format) 
Description: 
A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. 
Unconditionally, the address of the instruction after the delay slot is stored in the 
link register, r3/. If the contents of general-purpose register rs are less than 0, then 
the program branches to the branch address, with a delay of one instruction. 
Usually, general-purpose register 73/7 should not be specified as general-purpose 
register rs, because the contents of rs are overwritten by storing the link address, 
and then it is not re-executable. An attempt to use r3/ does not generate an 
exception, however. 
Operation: 
32 TT: target < (offset;s)'* || offset || 07 
condition <— (GPR[rs]3; = 1) 
GPR[381] < PC +8 
T+1: if condition then 
PC < PC + target 
endif 
64. T: target < (offset;s)*® || offset || 0° 
condition — (GPR[rs]g3 = 1) 
GPR[31] < PC +8 
T+1: if condition then 
PC < PC + target 
endif 
Exceptions: 
None 
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BLTZALL 


31 26 25 21 20 16 15 0 
REGIMM rs BLTZALL offset 
000001 10010 
6 5 5 16 
Format: 
BLTZALL rs, offset (MIPS II format) 
Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. 
Unconditionally, the instruction after the delay slot is stored in the link register, 
r31. If the contents of general-purpose register rs are smaller than 0, then the 
program branches to the branch address, with a delay of one instruction. 


If it does not branch, the instruction in the branch delay slot is discarded. 
Usually, general-purpose register r3/ should not be specified as general-purpose 
register rs, because the contents of rs are overwritten by storing the link address, 


and then it is not re-executable. An attempt to use r3/ does not cause an exception, 


however. 


Operation: 


32 TT: target < (offset;s)'* || offset || 0° 
condition <— (GPR[rs]3; = 1) 
GPR[31] — PC +8 
T+1: if condition then 


PC < PC + target 


eke NullifyCurrentinstruction 


endif 
64 T: target < (offsetys)*® || offset || 07 
condition <— (GPR[rs]g3 = 1) 
GPR[31] — PC +8 
T+1: if condition then 


PC < PC + target 


eRe NullifyCurrentinstruction 


endif 


Exceptions: 


None 
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BLTZL 


Branch on Less Than Zero Likely 


BLTZL 


31 26 25 21 20 16 15 0 
REGIMM rs BLTZL offset 
000001 00010 
6 5 5 16 
Format: 


BLTZL rs, offset 


Description: 


(MIPS II format) 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. 
Unconditionally, the instruction after the delay slot is stored in the link register, 


r31. If the contents of general-purpose register rs are less than 0, then the program 
branches to the branch address, with a delay of one instruction. 


If it does not branch, the instruction in the branch delay slot is discarded. 


Operation: 


32 TT: target < (offset;s)'* || offset || 0° 
condition <— (GPR[rs]3; = 1) 
: if condition then 
PC < PC + target 


NullifyCurrentinstruction 


ndif 
target < (offset,s)“° || offset || 0° 
condition — (GPR[rs]g3 = 1) 
if condition then 
PC < PC + target 


T+1: 


else 
NullifyCurrentinstruction 
endif 


Exceptions: 


None 
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Branch on Not Equal B N E 


31 26 25 21 20 16 15 0 
BNE rs rt offset 
000101 
6 5 5 16 
Format: 
BNE rs, rt, offset (MIPS I format) 
Description: 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. The 
contents of general-purpose register rs and the contents of general-purpose 
register rt are compared. If the two registers are not equal, then the program 
branches to the branch address, with a delay of one instruction. 


Operation: 
32 T: target < (offset;s)'* || offset || 07 
condition <— (GPR[rs] # GPR[rt]) 
T+1: if condition then 
PC < PC + target 
endif 
64‘ T: target < (offset;s)*© || offset || 07 
condition <— (GPR[rs]  GPRi[rt]) 
T+1: if condition then 
PC < PC + target 
endif 
Exceptions: 
None 
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BNEL 


Branch on Not Equal Likely 


BNEL 


31 26 25 21 20 16 15 0 
BNEL rs rt offset 
010101 
6 5 5 16 
Format: 


BNEL rs, rt, offset 


Description: 


(MIPS II format) 


A branch address is calculated from the sum of the address of the instruction in the 
delay slot and the 16-bit offset, shifted two bits left and sign extended. The 
contents of general-purpose register rs and the contents of general-purpose 
register rt are compared. If the two registers are not equal, then the program 
branches to the branch address, with a delay of one instruction. 


If it does not branch, the instruction in the branch delay slot is discarded. 


Operation: 


32 T: 


T+1: 


target < (offset;s)'4 || offset || 07 
condition < (GPR[rs]  GPRi[rt]) 


: if condition then 


PC < PC + target 
else 

NullifyCurrentinstruction 
endif 
target < (offset;s)“© || offset || 0° 
condition <— (GPR[rs] # GPRirt]) 
if condition then 

PC < PC + target 
else 

NullifyCurrentinstruction 


endif 


Exceptions 


None 
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BREAK Breakpoint BREAK 


31 26 25 65 0 
SPECIAL code BREAK 
000000 001101 

6 20 6 
Format: 
BREAK (MIPS I format) 
Description: 


A Breakpoint exception occurs after execution of this instruction, transferring 
control to the exception handler. 


The code field is available for transferring parameters to the exception handler. 
The parameter is retrieved by the exception handler only by loading as data the 
contents of the memory word containing the instruction. 


Operation: 


32, 64T: BreakpointException 


Exceptions: 


Breakpoint exception 
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CACH E Cache Operation CACH E 


31 26 25 21 20 16 15 0 
CACHE base op offset 
101111 
6 5 5 16 
Format: 


CACHE op, offset (base) (MIPS ITI format) 


Description: 


380 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The virtual address is translated to a 
physical address using the TLB, and the 5-bit sub-opcode specifies a cache 
operation for that address. 


The index operation uses part of the virtual address to specify a cache block. 


For a cache of 32 KB with 32 bytes per tag, vAddr 13.5 specifies the block. Bit 0 of 
the virtual address is used to specify the associativity. 


Index Load Tag uses vAddr, pygpirts... 3 to select the doubleword for reading 
parity. When the CE bit of the Status register is set, Hit WriteBack, Hit WriteBack 
Invalidate, Index WriteBack Invalidate, and Fill also use vAddr, tygpits ... 3 to 
select the doubleword that has its parity modified. This operation is performed 
unconditionally. 


The hit operation accesses the specified cache as normal data references and 
performs the specified operation if the cache block contains valid data with the 
specified physical address (a hit). If the cache block is invalid or contains a 
different address (a miss), no operation is performed. 


During a write-back operation, modified data in the cache (i.e., “dirty” data) is 
written to main memory. The address to be written is specified by the cache tag 
and not the translated physical address. 
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Cache Operation 
(continued) CACH E 


Bits 17...16 of the instruction specify the cache as follows: 


Code Name Cache 
00 I Instruction 
01 D Data 
10 — Reserved 
11 — Reserved 


Bits 20 to 18 (this value is listed under the Code column) of the instruction specify 
the operation as follows: 


Code | Cache Name Operation 
Index : 
000 I : Set the cache state of the cache block to Invalid and Unlocked. 
Invalidate 
Examine the cache state of the data cache block at the index 
Index specified by the virtual address. If the state is Dirty and not 
000 D WriteBack | Invalid, writes the block back to memory. The address to write 
Invalidate is taken from the cache tag. Set the cache state of the cache 
block to Invalid. May be used to unlock a cache block. 
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CACHE 


Cache Operation 
(continued) CACH E 


010 


I,D 


Index Store 
Tag 


Code | Cache Name Operation 
Reads the tag for the cache block at the specified index and 
Index Load | places it into the TagLo and TagHi CPO registers, ignoring any 
001 All : ae : wig 
Tag parity errors. In addition, the data parity from the specified 


doubleword is loaded into the PErr register. 


Write the tag for the cache block at the specified index from the 
TagLo and TagHi CPO registers, including the parity bit (P) 
from the TagLo register. 


O11 


382 


Create Dirty 


This operation is used to avoid loading data needlessly from 
memory when writing new contents into an entire cache block. 
If the cache block does not contain the specified address and the 
block is dirty, write it back to memory. In all cases, set the cache 
block tag to the specified physical address and set the cache 
state to Dirty. 
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Cache Operation 
(continued) CACH E 


Code | Cache Name Operation 
Hit If the cache block contains the specified address, mark the cache 
100 I,D : : 
Invalidate block Invalid. 
ELE If the cache block contains the specified address, write the data 
101 D WriteBack wpdace ts : 
‘ back if it is dirty. In all cases, mark the cache block Invalid. 
Invalidate 
101 I Fill Fill the instruction cache block from memory. 
110 D Hit If the cache block contains the specified address and its state is 
WriteBack | Dirty, write back the data and clear the state to not Dirty. 
This operation is used to lock a cache block. If the cache block 
does not contain the specified address, fill it from memory, 
Fetch and i oe : 

111 D Lock writing the original block back to memory using the tag address 
if the block was dirty. In all cases, set the cache block tag to the 
specified physical address and set the cache state to Locked. 
This operation is used to lock a cache block. If the cache block 

Ww I Fetch and does not contain the specified address, fill it from memory. In 

Lock all cases, set the cache block tag to the specified physical 
address and set the cache state to Locked. 


TLB Refill and TLB Invalid exceptions can occur on any operation. For Index 
operations (where the physical address is used to index the cache but need not 
match the cache tag), unmapped addresses may be used to avoid TLB exceptions. 
This operation never causes TLB Modified exceptions. 


If CPO is not enabled (i.e., the CPO enable bit in the Status register is clear in User 
or Supervisor mode) and this instruction is executed, a Coprocessor Unusable 


exception is taken. The operation of this instruction on any operation/cache 


combination not listed in the table is undefined. The operation of this instruction 
on uncached addresses is also undefined. 


The processor only fills the I-cache line using the cache instruction “Fill” when 
the data is not stored in the cache. 
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384 


Operation: 


32,64T:  vAddr < ((offset;s)*® || offsetys,_9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
CacheOp (op, vAddr, pAddr) 


Exceptions: 


Coprocessor Unusable exception 
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CFC1 Move Control Word from FPU CFC1 


(Coprocessor 1) 


31 26 25 21 20 16 15 11 10 0 
COP1 CF rt fs 0 
010001 00010 00000000000 
6 5 5 5 11 
Format: 
CFCI1 rt, fs (MIPS I format) 
Description: 


The contents of the floating-point control register fs are loaded into general- 
purpose register rt, with sign extension if the destination register is 64 bits. 


This instruction is only defined when fs equals 0 or 31. 


For MIPS I, MIPS II, and MIPS II, the contents of general-purpose register rt are 
undefined while the instruction immediately following this Load instruction is 
being executed. 


Operation: 


32 T: temp < FCRi[fs] 
T+1: GPR[rt] — temp 


64 T: temp < FCRifs] 
T+1: GPR[rt] — (temp31)°" || temp 


Exceptions: 


Coprocessor Unusable exception 
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CO P Z Coprocessor z Operation CO Pp Z 


31 26 2524 0 
COPz CO cofun 
0100xx 1 
6 1 25 
Format: 
COPz cofun (MIPS I format) 
Description: 


A coprocessor operation is performed. The operation may specify and reference 
internal coprocessor registers, and may change the state of the coprocessor 
condition line, but does not modify states within the processor, cache, or main 
memory. 


Operation: 


32,64T: | CoprocessorOperation (z, cofun) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception (CP1 only) 


Opcode Bit Encoding: 
COPz Bit #31 30 29 28 27 26 25 0 
COPO 
Bit #31 30 29 28 27 26 25 0 
COP1 
Bit #31 30 29 28 27 26 25 0 
COP2 
A ¥ 
Opcode _ Coprocessor Sub-opcode 
— Coprocessor Number 
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Move Control Word to FPU 


(Coprocessor 1) 


CPU Instruction Set 


CTC1 


31 


26 25 


21 20 16 15 


11 10 


CT 
00110 


0 
00000000000 


5 


11 


Format 


Descrip 


CTCI1 rt, fs 


tion: 


(MIPS I format) 


The contents of general-purpose register rt are stored in floating-point control 
register fs. This instruction is defined only if fs is 0 or 31. 


If any cause bit of the Floating-Point Control/Status register (FCR31) and its 


corresponding enable bit are set by writing data to FCR31, the Floating-Point 
exception occurs. The data is written to the register before the exception occurs. 


For MIPS I, MIPS II, and MIPS II, the contents of the Floating-Point Control 
register fs are undefined while the instruction immediately following this 
instruction is executed. 


Operation: 


32 


64 


temp <— GPRi[rt] 
FCR[fs] < temp 
COC[1] — FCR[31]o3 
temp <- GPRIrt]31..0 
FCR[fs] < temp 
COC[1] — FCR[31]o3 
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Move Control Word to FPU 
CTC1 (Coprocessor 1) CTC1 


(continued) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Division by Zero exception 

Inexact Operation exception 
Overflow exception 

Underflow exception 
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DADD Doubleword Add DADD 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 DADD 
000000 00000 101100 

6 5 5 5 5 6 
Format: 
DADD rd, rs, rt (MIPS III format) 
Description: 


The contents of general-purpose register rs and the contents of general-purpose 
register rt are added, and the result is stored in general-purpose register rd. An 
Integer Overflow exception occurs if the carries-out of bits 62 and 63 differ 
(two’s-complement overflow). The contents of the destination register rd are not 
modified when an Integer Overflow exception occurs. 


This operation is only defined for 64-bit mode and 32-bit Kernel mode. Execution 
of this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T:  GPR{[rd] <GPR[rs] + GPRI[rt] 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Integer Overflow exception 
Reserved Instruction exception 
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DADDI Doubleword Add Immediate DADDI 


31 26 25 21 20 16 15 0 
DADDI rs rt immediate 
011000 
6 5 5 16 
Format: 


DADDI rt, rs, immediate (MIPS III format) 


Description: 


The 16-bit immediate is sign extended and added to the contents of general- 
purpose register rs. The result is stored in general-purpose register rt. An Integer 
Overflow exception occurs if the carries-out of bits 62 and 63 differ (two’s- 
complement overflow). The contents of the destination register rt are not modified 
when an Integer Overflow exception occurs. 


This operation is only defined in 64-bit mode and 32-bit Kernel mode. Execution 
of this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: GPR [rt] — GPR[rs] + (immediate,5)*° || immediate;5 0 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Integer Overflow exception 
Reserved Instruction exception 
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Doubleword Add 
DADDIU Immediate Unsigned DADDIU 
31 26 25 21 20 16 15 0 


DADDIU 
011001 


rs rt immediate 


6 5 5 16 


Format: 
DADDIU rt, rs, immediate (MIPS III format) 


Description: 


The 16-bit immediate is sign extended and added to the contents of general- 
purpose register rs. The result is stored in general-purpose register rt. 


This operation is only defined in 64-bit mode and in 32-bit Kernel mode. 
Execution of this instruction in 32-bit User or Supervisor mode causes a Reserved 
Instruction exception. 


The only difference between this instruction and the DADDI instruction is that the 
DADDIU instruction never causes an Integer Overflow exception. 


Operation: 


64 T: GPR [rt] — GPR[rs] + (immediate,5)*° || immediate;5 9 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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D AD D U Doubleword Add Unsigned D AD D U 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 DADDU 
000000 00000 101101 
6 5 5 5 5 6 

Format: 
DADDU rd, rs, rt (MIPS III format) 
Description: 


The contents of general-purpose register rs and the contents of general-purpose 
register rt are added, and the result is stored in general-purpose register rd. 


This operation is only defined in 64-bit mode and in 32-bit Kernel mode. 
Execution of this instruction in 32-bit User or Supervisor mode causes a Reserved 
Instruction exception. 


The only difference between this instruction and the DADD instruction is that the 
DADDU instruction never causes an Integer Overflow exception. 


Operation: 


64 TT:  GPR{[rd] <GPR{irs] + GPRI[rt] 


Note: Same operation in the 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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DBREAK Debug Break DBREAK 


31 26 25 Gag 0 
SPECIAL2 0 DBREAK 
011100 0O0000000000000000000 411111 
6 20 6 
Format: 
DBREAK (VR5432 format) 
Description: 


The DBREAK instruction forces entry into Debug mode by causing a trap to the 
Debug Exception vector address (OxFFFF FFFF BFCO 1000). This instruction 
may only be executed in User, Supervisor, or Kernel mode. Execution in Debug 
mode results in undefined behavior. 


Execution transitions to Debug mode at an instruction boundary, the program 
counter (PC) is saved in the DEPC register, and execution is redirected to the 64- 
bit Debug Exception vector (location OxFFFF FFFF BFCO 1000). 


Before the processor enters Debug mode, all instructions are flushed from the 
pipeline and all outstanding external bus transactions are completed. There may 
be a delay entering Debug mode to allow the pipeline flush and to allow all 
outstanding external transactions to complete. The processor stalls during this 
time. 


The processor will not enter Debug mode at a branch delay slot instruction 
boundary. Instead, it stops either at the Branch instruction or the target of the 
branch. If a software or hardware breakpoint occurs for the branch delay slot 
instruction, the breakpoint occurs at the corresponding Branch instruction. If a 
single-step break is executed on a Branch instruction, both the branch and its delay 
slot are executed. 


If the DME bit in the Status register is not set, a Reserved Instruction exception 
will occur when DBREAK is issued. 
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DBREAK Debug Break 


(Continued) 


Operation: 


DBREAK 


32,64 T: DBreakOperation () 


Exceptions: 


Reserved Instruction exception 
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Doubleword Divide DDIV 


31 26 25 21 20 16 15 6 5 0 
SPECIAL rs rt 0 DDIV 
000000 0000000000 011110 
6 5 5 10 6 
Format: 
DDIV'rs, rt (MIPS ITI format) 
Description: 


The contents of general-purpose register rs are divided by the contents of general- 
purpose register rf, treating both operands as signed integers. An Integer Overflow 
exception never occurs, and the result of this operation is undefined when the 
divisor is zero. 


This instruction is usually executed after additional instructions to check for a zero 
divisor and for overflow. 


When the operation completes, the quotient word of the doubleword result is 
stored in special register LO, and the remainder word of the doubleword result is 
stored in special register HI. 


If either of the two preceding instructions is MFHI or MFLO, the results of those 
instructions are undefined. To obtain correct results, insert two or more additional 
instructions between MFHI or MFLO and the DDIV instruction. 


This operation is only defined in 64-bit mode and 32-bit Kernel mode. Execution 
in 32-bit User or Supervisor mode causes a Reserved Instruction exception. 


Operation: 
64 T-2: LO < undefined 
HI < undefined 
T-1: LO < undefined 
HI < undefined 
T: LO < GPR[rs] div GPR[rt] 
HI < GPRirs] mod GPR[rt] 
Note: Same operation in 32-bit Kernel mode. 
Exceptions: 


Reserved Instruction exception 
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DDIVU 


Doubleword Divide Unsigned DDIVU 


31 


26 25 21 20 16 15 6 5 0 


SPECIAL 
000000 


rs rt 0 DDIVU 
0000000000 011111 


6 


5 5 10 6 


Format: 


DDIVU rs, rt (MIPS III format) 


Description: 


The contents of general-purpose register rs are divided by the contents of general- 
purpose register rt, treating both operands as unsigned integers. An Integer 
Overflow exception never occurs, and the result of this operation is undefined 
when the divisor is zero. 


This instruction is usually executed after instructions to check for a zero divisor. 


When the operation completes, the quotient (doubleword) is stored into special 
register LO and the remainder (doubleword) is stored into special register HI. 


If either of the two preceding instructions is MFHI or MFLO, the results of those 
instructions are undefined. To obtain correct results, insert two or more 
instructions between MFHI or MFLO and the DDIVU instruction. 


This operation is only defined for the VR5432 operating in 64-bit mode and in 32- 
bit Kernel mode. Execution of this instruction in 32-bit User or Supervisor mode 
causes a Reserved Instruction exception. 


Operation: 


64 


T-2: LO < undefined 
HI < undefined 

T-1: LO < undefined 
HI < undefined 

T: LO < (0 || GPR[rs]) div (0 || GPR[rt]) 
HI < (0 || GPR[rs]) mod (0 || GPR[rt]) 
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Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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Divide D IV 


31 26 25 21 20 16 15 6 5 0 
SPECIAL rs rt 0 DIV 
000000 0000000000 011010 
6 5 5 10 6 
Format: 
DIV rs, rt (MIPS I format) 
Description: 


The contents of general-purpose register rs are divided by the contents of general- 
purpose register rt, treating both operands as signed integers. An Overflow 
exception never occurs, and the result of this operation is undefined when the 
divisor is zero. In 64-bit mode, the result must be sign-extended, 32-bit values. 


This instruction is usually executed after instructions to check for a zero divisor 
and for overflow 


When the operation completes, the quotient (doubleword) is stored into special 
register LO and the remainder (doubleword) is stored into special register HI. 


If either of the two preceding instructions is MFHI or MFLO, the results of those 
instructions are undefined. To obtain correct results, insert two or more additional 
instructions between MFHI or MFLO and the DIV instructions. 
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DIV Divide DIV 


(continued) 


Operation: 
32 T-2: LO < undefined 
HI < undefined 
T-1: LO < undefined 
HI < undefined 
T: LO < GPRIrs] div GPR[rt] 
HI < GPRirs] mod GPRirt] 
64 T-2: LO < undefined 
HI < undefined 
T-1: LO < undefined 
HI < undefined 
T: q GPRIrs]31..0 div GPR[rt]31..0 
r — GPR[rs]31..0 mod GPR[rt]31..0 
LO © (931)? |I Q31...0 
HI = (131) |] f31...0 
Exceptions: 
None 
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DIVU Divide Unsigned DIVU 


31 26 25 21 20 16 15 6 5 0 
SPECIAL rs rt 0 DIVU 
000000 0000000000 011011 
6 5 5 10 6 
Format: 
DIVUrs, rt (MIPS I format) 
Description: 


The contents of general-purpose register rs are divided by the contents of general- 
purpose register rt, treating both operands as unsigned integers. An Integer 
Overflow exception never occurs, and the result of this operation is undefined 
when the divisor is zero. In 64-bit mode, the result must be sign-extended, 32-bit 
values. 


This instruction is usually executed after instructions to check for a zero divisor. 


When the operation completes, the quotient (doubleword) is stored into special 
register LO and the remainder (doubleword) is stored into special register HI. 


If either of the two preceding instructions is MFHI or MFLO, the results of those 
instructions are undefined. To obtain correct results, insert two or more additional 
instructions between MFHI or MFLO and the DIVU instruction. 
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DIVU Divide Unsigned DIVU 


(continued) 


Operation: 
32 T-2: LO < undefined 
HI < undefined 
T-1: LO < undefined 
HI < undefined 
T: LO < (0 || GPR[rs]) div (0 || GPR{[rt]) 
HI < (0 || GPR[rs]) mod (0 || GPR[rt]) 
64 T-2: LO < undefined 
HI < undefined 
T-1: LO < undefined 
HI < undefined 
T: q <= (0 || GPR[rs]31...9) div (0 || GPR[rt]s1 ...0) 
r < (0 || GPR[rs]31...9) mod (0 || GPR[rt]31...0) 
LO = (q31)°* |] G31...0 
HI © (r31)*? [I r31...0 
Exceptions: 
None 
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Doubleword Move from 
D M FCO System Control Coprocessor D M FCO 


31 26 25 21 20 16 15 11 10 0 
COPO DMF rt rd 0 
010000 | 00001 00000000000 
6 5 5 5 11 
Format: 
DMEFCO rt, rd (MIPS III format) 
Description: 


The contents of coprocessor register rd of CPO are stored in general-purpose 
register rt. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


The contents of the source coprocessor register rd are written to the 64-bit 
destination general-purpose register rt. The operation of a DMFCO instruction on 
a 32-bit register of CPO is undefined. 


Operation: 


64 T: data <CPR[0,rd] 
T+1: GPR[rt] <— data 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Coprocessor Unusable exception (64-/32-bit User mode and Supervisor mode if 
CPO is disabled) 


Reserved Instruction exception (32-bit User or Supervisor mode) 
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Doubleword Move to 
D MTCO System Control Coprocessor D MTCO 


31 


26 25 21 20 16 15 0 


COPO 
010000 


DMT d 0 
00101 i : 00000000000 


6 


5 5 16 


Format: 


DMTCO rt, rd (MIPS III format) 


Description: 


The contents of general-purpose register rt are loaded into coprocessor register rd 
of CPO. 


This operation is defined in 64-bit mode or in 32-bit Kernel mode. Execution of 
this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


The contents of the source general-purpose register rd are written to the 64-bit 
destination coprocessor register rt. The operation of a DMTCO instruction on a 32- 
bit register of CPO is undefined. 


Because the state of the virtual address translation system may be altered by this 
instruction, the operation of Load instructions, Store instructions, and TLB 
operations for the instructions immediately before and after this instruction are 
undefined. 


Operation: 


64 


T: data — GPR[rt] 
T+1: CPR[O, rd] < data 


402 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Coprocessor Unusable exception (64-/32-bit User and Supervisor mode if CPO is 
disabled) 


Reserved Instruction exception (32-bit User or Supervisor mode) 
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DMULT 


Doubleword Multiply 


CPU Instruction Set 


DMULT 


Reserved Instruction exception 
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31 26 25 21 20 16 15 5 0 
SPECIAL rs rt 0 DMULT 
000000 0000000000 011100 
6 5 5 10 6 
Format: 
DMULT rs, rt (MIPS III format) 
Description: 
The contents of general-purpose registers rs and rt are multiplied, treating both 
operands as signed integers. An Integer Overflow exception never occurs. 
When the operation completes, the low-order doubleword is stored into special 
register LO and the high-order doubleword is stored into special register HI. 
If either of the two preceding instructions is MFHI or MFLO, the results of these 
instructions are undefined. To obtain correct results, insert two or more other 
instructions between MFHI or MFLO and the DMULT instruction. 
This operation is only defined in 64-bit mode and in 32-bit Kernel mode. 
Execution of this instruction in 32-bit User or Supervisor mode causes a Reserved 
Instruction exception. 
Operation: 
64 T-2: LO < undefined 
HI < undefined 
T-1: LO < undefined 
HI < undefined 
T: ot < GPR[rs] * GPR[rt] 
LO a tg3...0 
H | — t127...64 
Note: Same operation in 32-bit Kernel mode. 
Exceptions: 
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DMULTU 


Doubleword Multiply 


Unsigned 


DMULTU 


31 


26 25 


21 20 


16 15 


SPECIAL 
000000 


rs 


rt 


0 
0000000000 


DMULTU 
011101 


6 


10 


6 


Format 


Descrip 


DMULTU rs, rt 


tion: 


(MIPS III format) 


The contents of general-purpose registers rs and rt are multiplied, treating both 
operands as unsigned integers. An Overflow exception never occurs. 


When the operation completes, the low-order doubleword is stored into special 
register LO, and the high-order doubleword is stored into special register HI. 


If either of the two preceding instructions is MFHI or MFLO, the results of these 
instructions are undefined. To obtain correct results, insert two or more other 
instructions between MFHI or MFLO and the DMULTU instruction. 


This operation is defined in 64-bit mode and in 32-bit Kernel mode. Execution of 
this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 


exception. 


Operation: 


64 


T-2: 


LO < undefined 
HI < undefined 
LO < undefined 
HI < undefined 
t <— (0 || GPRirs]) * (0 || GPRirt]) 


LO < tg3__.0 
HI <ty97..64 
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Note: 


Exceptions: 


Same operation in 32-bit Kernel mode. 


Reserved Instruction exception 
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DRET 


Debug Return 


CPU Instruction Set 


DRET 


31 26 25 : 0 
SPECIAL2 0 DRET 
011100 00000000000000000000 111110 
6 20 6 
Format: 
DRET (VR5432 format) 
Description: 


The DRET instruction returns from Debug mode to the mode in effect (User, 


Supervisor, or Kernel mode) when the last debug break event has occurred. 
Control is passed to the instruction pointed to by the Debug Exception PC (DEPC) 
register. Unlike most jumps and branches, the execution of which also executes 

the next instruction (the one in the delay slot), DRET does not execute a delay slot 
instruction. The DRET instruction must not be placed in a branch delay slot. 


Operation: 


32,64 T: 


DRetOperation () 


Exceptions: 


None 
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DROR 


Doubleword Rotate Right 


DROR 


31 26 25 21 20 16 15 11 10 6 5 
SPECIAL 1 rt rd sa DROR 
000000 00001 111010 
6 5 5 5 5 6 
Format: 
DROR rd, rt, sa (VR5432 format) 


Description: 


The contents of general-purpose register rt are rotated right by sa bits, and the 
result is stored in general-purpose register rd. 


Operation: 


32, 64T: 


GPR[rd] <— GPR[rt]sa-1...0 Il GPRIrtle3...sa 


Exceptions: 


None 
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CPU Instruction Set 


DROR32 sear DROR32 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 1 rt rd sa DROR32 
000000 00001 111110 
6 5 5 5 5 6 
Format: 
DROR3? rd, rt, sa (VR5432 format) 
Description: 


The contents of general-purpose register rt are rotated right by sa + 32 bits, and 
the result is stored in general-purpose register rd. 


Operation: 


32,64T: s=sa+32 
GPR[rd] — GPR[rt]s-1...0 || GPRIrt]e3._.s 


Exceptions: 


None 
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DRORV vomuereacce —- DRORV 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 1 DRORV 
000000 00001 010110 
6 5 5 5 5 6 
Format: 
DRORYV rd, rt, rs (VR5432 format) 
Description: 


The contents of general-purpose register rf are rotated right by the number of bits 
specified by the low-order five bits of general-purpose register rs. The result is 
stored in general-purpose register rd. 


Operation: 


32,64T: s<GPRIrs]q o 
GPR[rd] <— GPR[rt]s-1__.0 || GPR[rtles...s 


Exceptions: 


None 
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CPU Instruction Set 


DSLL Doubleword Shift Left Logical DSLL 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 0 rt rd sa DSLL 
000000 00000 111000 
6 5 5 5 5 6 

Format: 
DSLL rd, rt, sa (MIPS ITI format) 
Description: 


The contents of general-purpose register rt are shifted left by sa bits, inserting 
zeros into the low-order bits. The result is stored in general-purpose register rd. 


This operation is defined in 64-bit mode and in 32-bit Kernel mode. Execution of 
this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: s¢O|lsa 
GPR[rd] — GPR[rt]¢3-s)...0 || 0° 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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DSLLV sean DSLLV 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 DSLLV 
000000 00000 010100 
6 5 5 5 5 6 

Format: 
DSLLYV rd, rt, rs (MIPS III format) 
Description: 


The contents of general-purpose register rt are shifted left by the number of bits 
specified by the low-order six bits contained in general-purpose register rs, 
inserting zeros into the low-order bits. The result is stored in general-purpose 
register rd. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: s <— GPRIrs]5..0 
GPR[rd]<— GPR[rt](63-s)...0 || 0° 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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DSLL32 


Doubleword Shift Left 
Logical Plus 32 


CPU Instruction Set 


DSLL32 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 0 rt rd sa DSLL32 
000000 00000 111100 
6 5 5 5 5 6 

Format: 


DSLL32 rd, rt, sa (MIPS ITI format) 


Description: 


The contents of general-purpose register rt are shifted left by 32 + sa bits, inserting 
zeros into the low-order bits. The result is stored in general-purpose register rd. 


This operation is defined in 64-bit mode and in 32-bit Kernel mode. Execution of 
this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 


exception. 
Operation: 
64 T: s<1||sa 
GPR[rd]<— GPR[rt](63-5)...0 || 0s 
Note: Same operation in 32-bit Kernel mode. 
Exceptions: 


Reserved Instruction exception 
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Doubl d 
DS RA Shift picks aausicde DS RA 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 0 rt rd sa DSRA 
000000 00000 111011 
6 5 5 5 5 6 

Format: 
DSRA rd, rt, sa (MIPS III format) 
Description: 


The contents of general-purpose register rt are shifted right by sa bits, sign 
extending the high-order bits. The result is stored in general-purpose register rd. 


This operation is defined in 64-bit mode and in 32-bit Kernel mode. Execution of 
this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: s<0||sa 
GPR[rd] <— (GPR[rtlga)* || GPRI[rt] 63...s 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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Doubleword Shift Right 
DS RAV Arithmetic Variable DS RAV 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 DSRAV 
000000 00000 010111 
6 5 5 5 5 6 
Format: 
DSRAYV rd, rt, rs (MIPS ITI format) 
Description: 


The contents of general-purpose register rt are shifted right by the number of bits 
specified by the low-order six bits of general-purpose register rs, sign extending 
the high-order bits. The result is stored in general-purpose register rd. 


This operation is defined in 64-bit mode and in 32-bit Kernel mode. Execution of 
this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: s«GPRirs]s5 0 
GPR[rd] — (GPRI[rtleg)° || GPR[rtHea...s 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 


Vr5432 Microprocessor User’s Manual 413 


Chapter 17 


DSRA32 ie enero DSRA32 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 0 rt rd sa DSRA32 
000000 00000 111111 
6 5 5 5 5 6 
Format: 
DSRA372 rd, rt, sa (MIPS ITI format) 
Description: 


The contents of general-purpose register rt are shifted right by 32 + sa bits, sign 
extending the high-order bits. The result is stored in general-purpose register rd. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: s <1 ||sa 
GPR[rd] <— (GPR[rt}g3)° || GPRIrt] 63...s 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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Doubleword 
DSRL Shift Right Logical DSRL 
31 26 25 21 20 16 15 11 10 6 5 0) 
SPECIAL 0 rt rd sa DSRL 
000000 00000 111010 
6 5 5 5 5 6 


Format: 
DSRL rd, rt, sa 


Description: 


(MIPS III format) 


The contents of general-purpose register rt are shifted right by sa bits, inserting 
zeros into the high-order bits. The result is stored in general-purpose register rd. 


This operation is defined in 64-bit mode and in 32-bit Kernel mode. Execution of 
this instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 


exception. 
Operation: 
64 T: s<0||sa 


GPR[rd] < 0° || GPR[rt]gg... 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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DSRLV Logical Varieties DSRLV 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 DSRLV 
000000 00000 010110 
6 5 5 5 5 6 

Format: 
DSRLYV rd, rt, rs (MIPS III format) 
Description: 


The contents of general-purpose register rt are shifted right by the number of bits 
specified by the low-order six bits of general-purpose register rs, inserting zeros 
into the high-order bits. The result is stored in general-purpose register rd. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: s<GPRIrs]s5. 0 
GPR[rd] < 0° || GPRIrtle3...5 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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CPU Instruction Set 


DSRL32 ures DSRL32 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 0 rt rd sa DSRL32 
000000 00000 111110 
6 5 5 5 5 6 
Format: 


DSRL32 rd, rt, sa 


Description: 


The contents of general-purpose register rt are shifted right by 32 + sa bits, 
inserting zeros into the high-order bits. The result is stored in general-purpose 
register rd. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: s<1||sa 
GPR[rd] < 08 || GPR[rt]g3._s 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Reserved Instruction exception 
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D S U B Doubleword Subtract D S U B 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 DSUB 
000000 00000 101110 
6 5 5 5 5 6 

Format: 
DSUB rd, rs, rt (MIPS III format) 
Description: 


The contents of general-purpose register rt are subtracted from the contents of 
general-purpose register rs, and the result is stored in general-purpose register rd. 


An Integer Overflow exception takes place if the carries-out of bits 62 and 63 
differ (a two’s-complement overflow). The contents of destination register rd are 
not modified when an Integer Overflow exception occurs. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 


exception. 
Operation: 
64 T: GPR[rd] <— GPRirs] — GPR[rt] 
Note: Same operation in 32-bit Kernel mode. 
Exceptions: 


Integer Overflow exception 
Reserved Instruction exception 
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DS U B U Doubleword Subtract Unsigned DS U B U 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 DSUBU 
000000 00000 101111 

6 5 5 5 5 6 
Format: 
DSUBU rd, rs, rt (MIPS III format) 
Description: 


The contents of general-purpose register rt are subtracted from the contents of 
general-purpose register rs, and the result is stored in general-purpose register rd. 


The only difference between this instruction and the DSUB instruction is that the 
DSUBU instruction never causes an Integer Overflow exception. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 


exception. 
Operation: 
64 T: GPR[rd] <— GPR[rs] — GPR[rt] 
Note: Same operation in 32-bit Kernel mode. 
Exceptions: 


Reserved Instruction exception 
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E R ET Return from Exception Ee R ET 


31 26 2524 65 0 
COPO oe) 0 ERET 
010000 1 0000000000000000000 011000 
6 1 19 6 
Format: 
ERET (MIPS III format) 
Description: 


ERET is for returning from an interrupt, exception, or error exception. Unlike a 
Branch or Jump instruction, ERET does not execute a delay slot instruction. 


The ERET instruction must not itself be placed in a branch delay slot. 


If the ERL bit of the Status register is set (SR, = 1), load the contents of the 
ErrorEPC register to the PC and clear the ERL bit to zero. Otherwise (SR, = 0), 
load the PC from the EPC, and clear the EXL bit of the Status register to zero 
(SR; =0). 


An ERET instruction executed between an LL instruction and an SC instruction 
causes the SC instruction to fail, since the ERET instruction clears the LL bit to 
Zero. 


Operation: 


32, 64 T: if SR» =1 then 

PC < ErrorEPC 

SR < SR31__3 || 9 || SRy_.0 
else 

PC ¢« EPC 

SR <— SR3}1_..2 || 0 || SRo 
endif 
LLbit — 0 


Exceptions: 


Coprocessor Unusable exception 
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CPU Instruction Set 


J Jump J 


31 26 25 0 
J target 
000010 
6 26 
Format: 
J target (MIPS I format) 
Description: 


The 26-bit target is shifted left two bits and combined with the high-order four bits 
of the address of the delay slot to calculate the target address. The program 
unconditionally jumps to this calculated address with a delay of one instruction. 


Because instructions must be word aligned, a J instruction must specify an address 
where the low-order two bits are zero. If these low-order two bits are not zero, an 
Address Error exception will occur when the Jump target instruction is fetched. 


Operation: 
32 T: temp < target 

T+1: PC — PCs; _ 99 || temp || 07 
64 T: temp < target 

T+1: PC © PCg3_9¢ || temp || 07 
Exceptions: 


Address Error exception 
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JAL Jump and Link 


JAL 


31 26 25 0 
JAL target 
000011 
6 26 
Format: 
JAL target (MIPS I format) 
Description: 


The 26-bit target is shifted left two bits and combined with the high-order four bits 
of the address of the delay slot to calculate the address. The program 
unconditionally jumps to this calculated address with a delay of one instruction. 
The address of the instruction after the delay slot is placed in the link register, r3/. 


Because instructions must be word aligned, a JAL instruction must specify an 


address where the low-order two bits are zero. If these low-order two bits are not 


zero, an Address Error exception will occur when the Jump target instruction is 


fetched. 
Operation: 


32 T: temp < target 
GPR[31] — PC +8 
T+1: PC — PC 34. 98 || temp || 07 


64 T: temp < target 
GPR[31] < PC +8 
T+1: PC — PC 69. 98 || temp || 07 


Exceptions: 


Address Error exception 
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JALR 


CPU Instruction Set 


Jump and Link Register JAL R 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs 0 rd 0 JALR 
000000 00000 00000 001001 
6 5 5 5 5 6 

Format: 
JALR rs (MIPS I format, rd = 31 implied) 
JALR rd, rs (MIPS I format) 

Description: 


The program unconditionally jumps to the address contained in general-purpose 
register rs, with a delay of one instruction. The address of the instruction after the 
delay slot is stored in general-purpose register rd. The default value of rd, if 
omitted in the assembly language instruction, is 31. 


Register numbers rs and rd should not be equal, because such an instruction does 
not have the same effect when re-executed. If they are equal, the contents of rs are 
destroyed by storing a link address. However, if an attempt is made to execute this 
instruction, an exception will not occur, and the result of executing such an 
instruction is undefined. 


Because instructions must be word aligned, a JALR instruction must specify a 
target register (rs) that contains an address where the low-order two bits are zero. 
If these low-order two bits are not zero, an Address Error exception will occur 
when the Jump target instruction is fetched. 


Operation: 
32, 64 T: temp < GPR [rs] 
GPR[rd] — PC + 8 
T+1: PC < temp 
Exceptions: 


Address Error exception 
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J R Jump Register J R 


31 26 25 21 20 65 0 
SPECIAL rs 0 JR 
000000 000000000000000 001000 
6 5 15 6 
Format: 
JR rs (MIPS I format) 
Description: 


The program unconditionally jumps to the address contained in general-purpose 
register rs, with a delay of one instruction. 


Because instructions must be word aligned, a JR instruction must specify a target 
register (rs) that contains an address where the low-order two bits are zero. If these 
low-order two bits are not zero, an Address Error exception will occur when the 

Jump target instruction is fetched. 


Operation: 

32, 64 T: temp < GPR[rs] 
T+1: PC < temp 

Exceptions: 


Address Error exception 
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CPU Instruction Set 


L B Load Byte L B 


31 26 25 21 20 16 15 0 
LB base rt offset 
100000 
6 5 5 16 
Format: 
LB rt, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the byte at the memory 
location specified by the address are sign extended and loaded into general- 
purpose register rt. 


Operation: 


32. T:  vAddr < ((offsetys)'® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpsize — 1 ... 3 || (PAddro. 9 xor ReverseEndian’ ) 
mem < LoadMemory (uncached, BYTE, pAddr, vAddr, DATA) 
byte < vAddro_ xor BigEndianCPU® 
GPRIrt] <— (mem7,8pyte)~* || MeM7,8*vyte...8*byte 

64 TT:  vAddr < ((offset;s)*? || offsetys 0) + GPR[ base] 
(pAddr, uncached) « AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgiz_ — 1 ... 3 || (PAddro. 9 xor ReverseEndian’®) 
mem < LoadMemory (uncached, BYTE, pAddr, vAddr, DATA) 
byte < vAddrs 9 xor BigEndianCPU® 
GPRIrt] — (mem7,8+pyte)”> || MeM7,8*vyte...8*byte 


Exceptions: 


TLB Miss exception 
TLB Invalid exception 
Bus Error exception 
Address Error exception 
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LBU 


Load Byte Unsigned L B U 


31 26 25 21 20 16 15 0 
LBU base rt offset 
100100 
6 5 5 16 
Format: 
LBU tt, offset(base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the byte at the memory 
location specified by the address are zero extended and loaded into general- 


purpose register rt. 


Operation: 


32 qe 


64 T: 


vAddr < ((offsetys) '® || offsetys__9) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpsize — 4 ... 3 || (Op Addrea_ 9 xor ReverseEndian’) 
mem < LoadMemory (uncached, BYTE, pAddr, vAddr, DATA) 
byte < vAddro _ xor BigEndianCPU? 

GPR[rt] — 0° || mem7,¢* byte...8* byte 

vAddr < ((offset;s)*° || offsetys 9) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgize — 1...3 || (PAddro. 9 xor ReverseEndian’) 
mem < LoadMemory (uncached, BYTE, pAddr, vAddr, DATA) 
byte < vAddro_ 9 xor BigEndianCPU® 

GPRIrt] < 0°° || mem7.8* byte...8* byte 


Exceptions: 


TLB Miss exception TLB Invalid exception 
Bus Error exception Address Error exception 
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CPU Instruction Set 


L D Load Doubleword L D 


31 26 25 21 20 16 15 0 
LD base rt offset 
110111 
6 5 5 16 
Format: 
LD rt, offset (base) (MIPS III format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the 64-bit doubleword at 
the memory location specified by the address are loaded into general-purpose 
register rt. 


If any of the low-order three bits of the address are not zero, an Address Error 
exception occurs. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T:  vAddr< ((offsetys)*® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, DOUBLEWORD, pAddr, vAddr, DATA) 
GPR[rt] — mem 


Note: In 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
Reserved Instruction exception 
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L D Cz Load Doubleword to Coprocessor z L D Cz 
31 26 25 21 20 16 15 0 

LDCz base rt offset 

1101xx 

6 5 5 16 
Format: 
LDCz rt, offset (base) (MIPS II format) 

Description: 


428 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The processor loads a doubleword from the 
addressed memory location to CPz. The manner in which each coprocessor uses 
the data is defined by the individual coprocessor specifications. 


If any of the low-order three bits of the address are not zero, an Address Error 
exception takes place. 


This instruction is not valid for use with CPO. 


When CP1 is specified, the FR bit of the Status register equals zero and the least- 
significant bit in the rt field is not zero; the operation of the instruction is 
undefined. If the FR bit equals one, an odd or even register is specified by rt. 
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L D Cc Z Load Doubleword to Coprocessor z L D Cc Z 


(continued) 


Operation: 


32 TT:  vAddr < ((offset;s)'® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, DOUBLEWORD, pAddr, vAddr, DATA) 


COPzLD (rt, mem) 


64 T:  vAddr < ((offsetys)*® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, DOUBLEWORD, pAddr, vAddr, DATA) 


COP2zLD (rt, mem) 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
Coprocessor Unusable exception 


Opcode Bit Encoding: 
LDCz Bit #31 30 29 28 27 26 0 
LDC1 
Bit #31 30 29 28 27 26 0 
LDC2 
Opcode Coprocessor Number 
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LDL 


Load Doubleword Left L D L 


31 26 25 21 20 1615 0 
LDL base rt offset 
011010 
6 5 5 16 
Format: 
LDL rt, offset (base) (MIPS ITI format) 
Description: 


This instruction is used in combination with the LDR instruction to load the 
doubleword data in the memory that is not at the word boundary to general- 
purpose register rt. The LDL instruction loads the higher portion of the data to the 
register, while the LDR instruction loads the lower portion. 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to generate a virtual address that can specify any byte. Of the 
doubleword data in the memory where the most-significant byte is specified by the 
generated address, only the data at the same word boundary as the target address 
is loaded and stored to the higher portion of general-purpose register rt. The 
remaining portion of the register is not affected. Depending on the address 
specified, the number of bytes to be loaded changes from | to 8. 


In other words, first the addressed byte is stored to the most-significant byte 
position of general-purpose register rt. If there is data of the low-order byte that 
follows the same doubleword boundary, the operation to store this data to the next 
byte of general-purpose register rt is repeated. The remaining low-order byte is 
not affected. 


Memory 
(Big Endian) Register 
address 8 8 | 9| 10) 11} 12/13] 14/15) Before 
address0 |0/1/213/4/51| 6] 71 loading A| BIC} P| E|F) GH) $24 
LDL $24,3($0) 
After 
loading |° 4'5/6|7|)F) GlH} $24 
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(continued) 


The contents of general-purpose register rt are internally bypassed within the 
processor, so that no NOP instruction is needed between an immediately 
preceding Load instruction that targets general-purpose register rt and a 
subsequent LDL (or LDR) instruction. 


The Address Error exception does not occur even if the specified address is not at 
the doubleword boundary. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 


T: 


vAddr < ((offset;s)** || offsety5 9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpsjze-1...3 || (PAddrs. 9 xor ReverseEndian®) 
if BigEndianMem = 0 then 

pAddr < pAddrpsize-1...3 || 0° 
endif 
byte < vAddrs_9 xor BigEndianCPU? 


mem < LoadMemory (uncached, byte, pAddr, vAddr, DATA) 
GPR[rt] — mem7,¢*pyte...0 || GPR[tt]s55_s*byte...0 


Note: In 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 
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LDL 


Load Doubleword Left 
(continued) L D L 


The relationship between the address given to the LDL instruction and the result 
(bytes for registers) is shown below: 


LDL 
Register A B Cc D E F G H 
Memory | J K L M N O P 
BigEndianCPU = 0 BigEndianCPU = 1 
Offset Offset 
vAddrs__o Destination Type | LEM|BEM Destination Type} LEM BEM 
0 PBCDEFGH| 0 0/7 || J KLMNOP| 7 0 0 
1 OPCDEFGH| 1 0/6 |J KLMNOP H| 6 0 1 
2 NOPDEFGH} 2 0/5 |KLMNOPGH| 5 0 | 2 
3 MNOPEFGH} 383 0/4 |LMNOPFGH| 4 0| 3 
4 L MNOPFGH| 4 0/3 |MNOPEFGH| 3 0| 4 
5 KLMNOPGH} 5 0/2 |NOPDEFGH| 2 0 5 
6 J KLMNOPH| 6 0 | 1 OPCDEFGH| 1 0 6 
7 | J KLMNOP| 7 0 | 0 PBCDEFGH 0 0 7 
Note: Type: Access type output to memory (refer to Table 16-3 on 
page 324 for information on byte access within a double- 
word) 
Offset: | pAddr, ¢ Output to memory 
LEM Little-endian memory (BigEndianMem = 0) 
BEM Big-endian memory (BigEndianMem = 1) 
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Exceptions: 


TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
Reserved Instruction exception 
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LDR 


Load Doubleword Right L D R 


31 


26 25 21 20 16 15 0 


LDR 


011011 


base rt offset 


6 


Format: 


LDR rt, offset (base) (MIPS ITI format) 


Description: 


This instruction is combined with the LDL instruction to load the word data in the 
memory that is not at the word boundary to general-purpose register rt. The LDL 
instruction loads the higher portion of the data to the register, while the LDR 
instruction loads the lower portion. 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to generate a virtual address that can specify any byte. Of the word 
data in memory where the least-significant byte is specified by the generated 
address, only the data at the same doubleword boundary as the target address is 
loaded and stored to the lower portion of general-purpose register rt. The 
remaining portion of the register is not affected. Depending on the address 
specified, the number of bytes to be loaded changes from | to 8. 


In other words, first the addressed byte is stored to the least-significant byte 
position of general-purpose register rt. If there is data of the high-order byte that 
follows the same doubleword boundary, the operation to store this data to the next 
byte of general-purpose register rt is repeated. The remaining high-order byte is 
not affected. 


Memory 


(Big Endian) Register 


address 8 


9 {10 {11 |12 |13 | 14 |15 Betare 


address 0 


(=) 


17/2/3);4/);5/6|7 loading | A| B} C} D| E| F| G| H| $24 


LDR $24,4($0) 


After 
loading 


A|B/C/0)1/2/3)4 | $24 
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(continued) 


The contents of general-purpose register rt are bypassed within the processor so 
that no NOP instruction is needed between an immediately preceding Load 
instruction that targets general-purpose register rt and a subsequent LDR (or LDL) 
instruction. 


The Address Error exception does not occur even if the specified address is not 
located at the doubleword boundary. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


64 T: vAddr < ((offset;s)** || offset,s 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpsjze-1...3 || (PAddro. 9 xor ReverseEndian’) 
if BigEndianMem = 1 then 
pAddr < pAddr3;_ 3 || 0° 
endif 
byte < vAddro_9 xor BigEndianCPU® 
mem < LoadMemory (uncached, DOUBLEWORD - byte, pAddr, vAddr, DATA) 
GPR[rt] <— GPRIrtles...64-8*byte || MeMgs...s*byte 


Note: In 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 
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LDR 


Load Doubleword Right 


(continued) L D R 


The relationship between the address given to the LDR instruction and the result 
(bytes for registers) is shown below: 


LDR 
Register A B C D E F G H 
Memory | J K L M N O P 
BigEndianCPU = 0 BigEndianCPU = 1 
Offset Offset 
vAddro 9 Destination Type |LEM|BEM Destination Type| LEM|BEM 
0 | J KLMNOP)| 7 0/0 |ABCDEFGI 0 7 | 0 
1 Al JKLMNO)> 6 1/0 |ABCDEFI J} 14 6 0 
2 ABI JKLMN) 5 2;0 |ABCDEI J K} 2 5 0 
3 ABCIJKLM| 4 3/0 |ABCDIJKL} 3 4 | 0 
4 ABCDI JKL) 83 4;0 |ABCIJKLM 4 3 0 
5 ABCDEIJK) 2 5/0 |ABIJKLMN| 5 2 0 
6 ABCDEFI J 1 6|0 ;Al JKLMNO} 6 1 0 
vf ABCDEF GI 0 7 | 0 1 J KLMNOP| 7 0 0 
Note: Type: Access type output to memory (refer to Table 16-3 on 
page 324 for information on byte access within a double- 
word) 
Offset: pAddr» _ , Output to memory 
LEM Little-endian memory (BigEndianMem = 0) 
BEM Big-endian memory (BigEndianMem = 1) 
436 Vr5432 Microprocessor User’s Manual 


CPU Instruction Set 
Load Doubleword Right 
LDR (continued) LDR 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
Reserved Instruction exception 
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LH 


Load Halfword L H 


31 26 25 21 20 16 15 0 
LH base rt offset 
100001 
6 5 5 16 
Format: 
LH rt, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the halfword at the memory 
location specified by the address are sign extended and loaded into general- 
purpose register rt. 


If the least-significant bit of the address is not zero, an Address Error exception 
occurs. 


Operation: 


32 T: 


64 T: 


vAddr < ((offset;s)'® || offset;s_09) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA) 

pAddr <— pAddrpsize — 1...3 || (PAddro. 9 xor (ReverseEndian? || 0)) 
mem < LoadMemory (uncached, HALFWORD, pAddr, vAddr, DATA) 
byte <— vAddro_ xor (BigEndianCPU? || 0) 

GPRIrt] <— (mem15.¢%byte) ° || Mem15,8*yte...8* byte 

vAddr < ((offsetys)*® || offsetis. 9) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA) 

pAddr <— pAddrpgize — 1...3 || (PAAdre_ 9 xor (ReverseEndian? || 0)) 
mem < LoadMemory (uncached, HALFWORD, pAddr, vAddr, DATA) 
byte <— vAddro 9 xor (BigEndianCP ={i:0) 

GPRi[rt] — (mem45,8*byte) '° || MeM15,8*byte...8* byte 


Exceptions: 
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TLB Miss exception 
TLB Invalid exception 
Bus Error exception 
Address Error exception 
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LH U Load Halfword Unsigned LH U 


31 26 25 21 20 16 15 0 
LHU base rt offset 
100101 
6 5 5 16 
Format: 
LHU rt, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the halfword at the memory 
location specified by the address are zero extended and loaded into general- 
purpose register 7t. 


If the least-significant bit of the address is not zero, an Address Error exception 
occurs. 


Operation: 


32 TT:  vAddr< ((offset;s)'® || offsetys 9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgize — 1...3 || (PAddro. 9 xor (ReverseEndian? || 0)) 
mem < LoadMemory (uncached, HALFWORD, pAddr, vAddr, DATA) 
byte < vAddro_ xor (BigEndianCPU? || 0) 
GPRirt] — 0'° || mem15,s*byte...*byte 


64 T:  vAddr < ((offset,s)*® || offset;s 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgize — 1...3 || (PAddrs_ 9 xor (ReverseEndian? || 0)) 
mem < LoadMemory (uncached, HALFWORD, pAddr, vAddr, DATA) 
byte < vAddro_  xor (BigEndianCPU? || 0) 
GPRIrt] — 0%° || mem15,s*byte..s*byte 


Exceptions: 


TLB Miss exception TLB Invalid exception 
Bus Error exception Address Error exception 
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LL 


Load Linked LL 


31 26 25 21 20 16 15 0 
LL base rt offset 
110000 
6 5 5 16 
Format: 
LL rt, offset (base) (MIPS II format) 
Description: 


440 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the word at the memory 
location specified by the address are loaded into general-purpose register rt. In 64- 
bit mode, the loaded word is sign extended. In addition, the specified physical 
address of the memory is stored in the LLAddr register, and sets the LL bit to 1. 
Afterward, the processor checks whether the address stored in the LLAddr register 
has been rewritten by the other processors or devices. 


This instruction is provided for compatibility with MIPS implementations that 
implement multiprocessing facilities; however, the VR5432 does not implement 
these facilities. 


Load Linked (LL) and Store Conditional (SC) instructions can be used to update 
memory atomically: 


L1: 
LL T1, (T0) 
ADD T2, 71,1 
SC T2, (T0) 
BEQ T2, 0, L1 


This atomically increments the word addressed by TO. Changing the ADD 
instruction to an OR instruction changes this to an atomic bit set. 


This instruction is available in User mode; it is not necessary to enable CPO. 
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LL Load Linked LL 


(continued) 


If the specified address is in the noncache area, the operation of the LL instruction 
is undefined. A cache miss that occurs between the LL and SC instructions hinders 
execution of the SC instruction. Usually, therefore, one should not use a Load or 
Store instruction between the LL and SC instructions. Otherwise, the operation of 
the SC instruction is not guaranteed. If an exception frequently occurs, the 
exception also hinders execution of the SC instruction. It is therefore necessary to 
disable the exception temporarily. 


If either of the low-order two bits of the address is not zero, an Address Error 
exception takes place. 


Operation: 


32 TT:  vAddr < ((offset;s)'® || offsety5 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgize_-1...3 || (PAddre. 9 xor (ReverseEndian || 07)) 
mem < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
byte <— vAddrs_ xor (BigEndianCPU || 0°) 
GPR[rt] — mems31 + 8*byte...8*byte 
LLbit < 1 
LLAddr «+ pAddr 


64 T:  vAddr < ((offset;s)*® || offsetys._9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr < pAddrpgiz_e-1...3 || (PAddre_ 9 xor (ReverseEndian || 07)) 
mem < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
byte < vAddro_g xor (BigEndianCPU || 07) 
GPRi[rt] — (mems1,g*byte)”~ || MEM31.8*byte...8*byte 
LLbit < 1 
LLAddr < pAddr 


Exceptions: 


TLB Miss exception 
TLB Invalid exception 
Bus Error exception 
Address Error exception 
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LLD Load Linked Doubleword LLD 
31 26 25 21 20 16 15 0 

LLD base rt offset 

110100 

6 5 5 16 
Format: 
LLD rt, offset (base) (MIPS ITI format) 

Description: 
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The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the doubleword at the 
memory location specified by the address are loaded into general-purpose register 
rt. In addition, the specified physical address of the memory is stored in the 
LLAddr register, and sets the LL bit to 1. Afterward, the processor checks whether 
the address stored in the LLAddr register has been rewritten by the other 
processors or devices. 


This instruction is provided for compatibility with MIPS implementations that 
implement multiprocessing facilities; the VR5432 does not implement these 
facilities. 


The Load Linked Doubleword (LLD) instruction and the Store Conditional 
Doubleword (SCD) instruction can be used to update the memory atomically: 


L1: 
LL T1, (TO) 
DADD- T2,T1, 1 
SCD T2, (TO) 
BEQ T2, 0, L1 
NOP 


This atomically increments the doubleword addressed by TO. Changing the 
DADD instruction to an OR instruction changes this to an atomic bit set. 
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(continued) 


If the specified address is in a noncache area, the operation of the LLD instruction 
is undefined. If a data cache miss occurs between the LLD and SCD instructions, 
the operation of the SCD instruction is not guaranteed. Therefore, do not use a 
Load or Store instruction between the LLD and SCD instructions. An exception 
also causes the operation of the SCD instruction to not be guaranteed, so it is 
necessary to disable exceptions temporarily. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 


Operation: 


32 T:  vAddr < ((offset;s)'® || offsetys9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, DOUBLEWORD, pAddr, vAddr, DATA) 
GPR[rt] — mem 
LLbit < 1 
LLAddr < pAddr 


64 T:  vAddr < ((offset;s)*® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, DOUBLEWORD, pAddr, vAddr, DATA) 
GPR[rt] — mem 
LLbit <— 1 
LLAddr < pAddr 


Note: In 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 
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LLD Load Linked Doubleword LLD 


(continued) 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
Reserved Instruction exception 
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L U | Load Upper Immediate L U | 
31 26 25 21 20 16 15 0 
LUI 0 rt immediate 
001111 00000 
6 5 5 16 
Format: 
LUI rt, immediate (MIPS I format) 
Description: 


The 16-bit immediate is shifted left 16 bits and extended on the right with 16 bits 
of zeros. The result is placed into general-purpose register rt. In 64-bit mode, the 


32-bit result is sign extended to 64 bits. 


Operation: 


32 7:  GPR{[rt] < immediate || 0'° 


64. T:  GPR{rt] < (immediate;s)% || immediate || 0'° 


Exceptions: 


None 
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LW 


Load Word LW 


31 26 25 21 20 16 15 0 
LW base rt offset 
100011 
6 5 5 16 
Format: 


LW tt, offset (base) 


Description: 


(MIPS I format) 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the word at the memory 
location specified by the address are loaded into general-purpose register rt. In 64- 
bit mode, the loaded word is sign extended to 64 bits. 


If either of the low-order two bits of the address is not zero, an Address Error 


exception occurs. 


Operation: 


32 T:  vAddr< ((offset;s)!® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
GPR[rt] — mem 


64 T:  vAddr< ((offsetys)*® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
GPR[rt] — mem 


Exceptions: 


TLB Miss exception 


TLB Invalid exception 


Bus Error exception 


Address Error exception 
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LWCz Load Word to Coprocessor z LWCz 


31 26 25 21 20 16 15 0 
LWCz base rt offset 
1100xx* 
6 5 5 16 
Format: 
LWCz rt, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 

register base to form a virtual address. The processor loads a word at the addressed 
memory location to general-purpose register rt of CPz. The manner in which each 
coprocessor uses the data is defined by the individual coprocessor specifications. 


If either of the low-order two bits of the address is not zero, an Address Error 
exception occurs. 


This instruction is not valid for use with CPO. 
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Load Word to Coprocessor z 
LWCz (continued) LWCz 


Operation: 


32 T:  vAddr < ((offset;s)'® || offset;s._09) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAdar < pAddrpsiz_e-1...3 || (PAddro. 9 xor (ReverseEndian || 07)) 
mem < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
byte < vAddro_9 xor (BigEndianCPU || 0°) 
COPzLW (byte, rt, mem) 

64 T:  vAddr < ((offset;s)*® || offset; 0) + GPR[base] 
(pAddr, uncached)<— AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpsize-1...3 || (DAddro. 9 xor (ReverseEndian || 0°)) 
mem < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
byte < vAddro_ g xor (BigEndianCPU || 0°) 
COPzLW (byte, rt, mem) 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
Coprocessor Unusable exception 


Opcode Bit Encoding: 
LWCz Bit #31 30 29 28 27 26 0 
LWC1 
Bit #31 30 29 28 27 26 0 
LWC2 
Opcode  Coprocessor Number 
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Load Word Left LWL 


31 26 25 21 20 16 15 0 
LWL base rt offset 
100010 
6 5 5 16 
Format: 
LWL rt, offset (base) (MIPS I format) 
Description: 


This instruction is combined with the LWR instruction to load word data in 
memory that is not at a word boundary to general-purpose register rt. The LWL 
instruction loads the higher portion of the data to the register, while the LWR 
instruction loads the lower portion. 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to generate a virtual address that can specify any byte. Of the word 
data in the memory where the most-significant byte is specified by the generated 
address, only the data at the same word boundary as the target address is loaded 
and stored to the higher portion of general-purpose register rt. The remaining 
portion of the register is not affected. Depending on the address specified, the 
number of bytes to be loaded changes from | to 4. 


In other words, first the addressed byte is stored to the most-significant byte 
position of general-purpose register rt. If there is data of the high-order byte that 
follows the same word boundary, the operation to store this data to the next byte 
of general-purpose register rt is repeated. 


The remaining higher byte is not affected. 


Memor dan 
Big Endlan) Register 
address 4 4 5 7 Before 
A B D 
addresso | 0 | 1 : 3 loading e $24 


LWL $24,1($0) 


After 
loading |_1 2| 3)| D | $24 
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LWL 


Load Word Left 
(continued) 


The contents of general-purpose register rt are bypassed within the processor, so 
that no NOP instruction is needed between an immediately preceding Load 
instruction that targets general-purpose register rt and a subsequent LWL (or 


LWR) instruction. 


The Address Error Exception does not occur, even if the specified address is not 


located at the word boundary. 


Operation: 

32. T:  vAddr < ((offsetys) '® || offsetys._0) + GPR[base] 
(pAddr, uncached) <— AddressTranslation (vAddr, DATA) 
pAddr < pAddrpgize-1..3 || (PAddro. 9 xor ReverseEndian’) 
if BigEndianMem = 0 then 

pAddr < pAddrpsize-1...2 || 0° 

endif 
byte < vAddr,_9 xor BigEndianCPU2 
word <— vAddro xor BigEndianCPU 
mem < LoadMemory (uncached, byte, pAddr, vAddr, DATA) 
temp — MeM32*word+8*byte+7 || GPR[rtlo3-g*byte...0 
GRP[rt] <— temp 

64 T:  vAddr < ((offsetys)*® || offsetys 0) + GPR[base] 


(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr < pAddrpgize-1..3 || (PAddro. 9 xor ReverseEndian’) 
if BigEndianMem = 0 then 
pAddr < pAddrpgize-1...all 0° 
endif 
byte < vAddr,_9 xor BigEndianCPU2 
word <— vAddro xor BigEndianCPU 
mem < LoadMemory (uncached, byte, pAddr, vAddr, DATA) 


temp — MeM32*word+8*byte+7 || GPR[rtlo3-g*byte...0 
GPR{[rt] + (temp31)°" || temp 
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LW L (continued) LW L 


The relationship between the address given to the LWL instruction and the result 
(bytes for registers) is shown below: 


LWL 
Register A B Cc D E F G H 
Memory | J K L M|N O P 
BigEndianCPU = 0 BigEndianCPU = 1 
Offset Offset 
vAddro._o Destination Type| LEM|BEM Destination /Type|/LEM)/BEM 
0 SSSSPFGH| 0 0/7 |SSSSIJKL) 3 4/0 
1 SSSSOPGH| 1 0/6 ;|SSSSJKLH) 2 4/14 
2 SSSSNOPH| 2 0/5 |SSSSKLGH| 1 4/2 
3 SSSSMNOP} 3 0/4 |SSSSLFGH| 0 4 | 3 
4 SSSSLFGH| 0 41/3 |SSSSMNOP) 3 0| 4 
5 SSSSKLGH| 1 4/2 |SSSSNOPH; 2 0 | 5 
6 S§SSSJKLH| 2 4 | 1 SSSSOPGH| 1 0 | 6 
7 SSSSIJKL] 3 41/0 |SSSSPFGH, 0 0| 7 


Note: Type: Access type output to memory (refer to Table 16-3 on 
page 324 for information on byte access within a double- 
word) 


Offset. pAddr , Output to memory 
LEM Little-endian memory (BigEndianMem = 0) 
BEM Big-endian memory (BigEndianMem = 1) 


S: Sign extension of destination bit 31 
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LW L (continued) LW L 


Exceptions: 


TLB Miss exception 
TLB Invalid exception 
Bus Error exception 
Address Error exception 
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Load Word Right LWR 


31 26 25 21 20 16 15 0 
LWR base rt offset 
100110 
6 5 5 16 
Format: 
LWR rt, offset (base) (MIPS I format) 
Description: 


This instruction is combined with the LWL instruction to load the word data in the 
memory that is not at the word boundary to general-purpose register rt. The LWL 
instruction loads the higher portion of the data to the register, while the LWR 
instruction loads the lower portion. 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to generate a virtual address that can specify any byte. Of the word 
data in the memory where the least-significant byte is specified by the generated 
address, only the data at the same word boundary as the target address is loaded 
and stored to the lower portion of general-purpose register rt. The remaining 
portion of the register is not affected. Depending on the address specified, the 
number of bytes to be loaded changes from | to 4. 


In other words, first the addressed byte is stored to the least-significant byte 
position of general-purpose register rt. If there is data of the high-order byte that 
follows the same word boundary, the operation to store this data to the next byte 
of general-purpose register rt is repeated. 


The remaining high-order byte is not affected. 


Memory 
(Big Endian) Register 
address 4 4 5 6 7 Before 
addressO | 0 1 a.) 8 foading(e es AB Gs DE et 
LWR $24,4($0) 
After 
loading A B Cc 4 $24 


Vr5432 Microprocessor User’s Manual 453 


Chapter 17 


LWR Load Word Right LWR 


(continued) 


The contents of general-purpose register rt are bypassed within the processor, so 
that no NOP instruction is needed between an immediately preceding Load 
instruction that targets general-purpose register rt and a following LDL (or LWR) 
instruction. 


The Address Error exception does not occur even if the specified address is not 
located at the word boundary. 


Operation: 


32 T:  vAddr < ((offset;s)'® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr < pAddrpgjze-_1..3 || (pAddrs. 9 xor ReverseEndian’®) 
if BigEndianMem = 1 then 
pAddr < pAddrpgize-ai...3 || O° 
endif 
byte < vAddr,_9 xor BigEndianCPU2 
word < vAddro xor BigEndianCPU 
mem < LoadMemory (uncached, 0 || byte, pAddr, vAddr, DATA) 


temp <— memsg}__32-8*pyte...0 || MEM31432*word-32*word+8*byte 
GPR[rt] <— temp 


64 T:  vAddr < ((offset;s)*® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpsize-1...3 || (PAddro. 6 xor ReverseEndian®) 
if BigEndianMem = 1 then 
pAddr <- pAddrpsize-si...3 || 0° 
endif 
byte < vAddr, 9 xor BigEndianCPU? 
word < vAddro xor BigEndianCPU 
mem < LoadMemory (uncached, 0 || byte, pAddr, vAddr, DATA) 


temp <— memsg}__32-8*byte...0 |] MEM31432*word-32*word+8*byte 
GPR[rt] — (temp31)°" || temp 
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LWR 


Load Word Right 
(continued) 


CPU Instruction Set 


LWR 


The relationship between the address given to the LWR instruction and the result 
(bytes for registers) is shown below: 


LWR 
Register | A B C D E F G H 
Memory | J K L M N O P 
BigEndianCPU = 0 BigEndianCPU = 1 
Offset Offset 
vAddro Destination Type |LEM| BEM Destination /Type LEM|BEM 
0 SSSSMNOP| 3 0|/4 |\XXXXEFGI 0 7 | 0 
1 X XXXEMNO,> 2 1/4 |XX XXEFI J} 1 6 0 
2 XXXXEFMN) 1 2}4 |XX XXEI J K} 2 5 0 
3 XXXXEFGM 0O 3/4 |SSSSIJSKL] 3 4] 0 
4 SSSSIJKL]| 3 4/0 |XX XXEFGM 0 3] 4 
5 XXXXEI JK) 2 51/0 |XXXXEFMN] 1 2/4 
6 XXXXEFI J 1 6/0 |XX XXEMNO} 2 1 4 
7 XXXXEF GI 0 7/0 |SSSSMNOP} 3 0| 4 
Note: Type: Access type output to memory (refer to Table 16-3 on 


Vr5432 Microprocessor User’s Manual 


page 324 for information on byte access within a double- 


word) 


OffsetpAddr _, Output to memory 


LEMLittle-endian memory (BigEndianMem = 0) 
BEMBig-endian memory (BigEndianMem = 1) 


Sign extension of destination bit 31 


Not affected 
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LWR Load Word Right LWR 


(continued) 


Exceptions: 


TLB Miss exception 
TLB Invalid exception 
Bus Error exception 
Address Error exception 
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LWU 


Load Word Unsigned 


CPU Instruction Set 


LWU 


31 26 25 21 20 16 15 0 
LWU base rt offset 
101111 
6 5 5 16 
Format: 
LWU tt, offset (base) (MIPS III format) 
Description: 
The 16-bit offset is sign extended and added to the contents of the general-purpose 
register base to form a virtual address. The contents of the word at the memory 
location specified by the address are loaded into general-purpose register rt. The 
loaded word is zero-extended in 64-bit mode. 
If either of the low-order two bits of the effective address is not zero, an Address 
Error exception occurs. 
This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 
Operation: 
32 TT:  vAddr< ((offset,s)'® || offset,s 9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
GPR[rt] <— mem 
64 TT:  vAddr < ((offset;s)*® || offsetys 9) + GPR[base] 


(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
mem < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
GPR[rt] < 0°? || mem 


Note: In 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 
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Chapter 17 


LW U Load Word Unsigned LW U 


(continued) 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
Reserved Instruction exception 
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CPU Instruction Set 


MACC psig MACC 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MACC 
000000 00101011000 

6 5 5 5 11 
Format: 
MACC rd, rs, rt (VR5432 format) 
Description: 


The signed 32-bit operands in the rs and rt registers are multiplied, and the product 
is added to the signed contents of the 64-bit accumulator formed by the least- 
significant 32 bits of the HI and LO registers. A copy of the least-significant 32 
bits of the result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MACC instruction. 


Operation: 


32, 64 T: — Hlg4_.9 || LO31..0 — (Alg1._.0 |] LOg1...0) + (GPR[rs] * GPRI[rt]) 
GPRIrd]31_.0 < ((HI31...9 || LO31...0) + (GPR[rs] * GPR[rt]))31..0 


Exceptions: 


None 
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MACCHI uaeaeniisoairai MACCHI 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MACCHI 
000000 01101011000 

6 5 5 5 11 
Format: 
MACCHI rd, rs, rt (VR5432 format) 
Description: 


The signed 32-bit operands in the rs and rt registers are multiplied, and the product 
is added to the signed contents of the 64-bit accumulator formed by the least- 
significant 32 bits of the HI and LO registers. A copy of the most-significant 32 
bits of the result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MACCHI instruction. 


Operation: 


32, 64 T: —— Hlg4_9 || LO31...0 — (Alg1._.0 |] LOg1...0) + (GPR[rs] * GPRI[rt]) 
GPR[rd]31_.0 — (Hl31._.9 || LO31__.0) + (GPRIrs] * GPR[rt]))¢3..32 


Exceptions: 


None 
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CPU Instruction Set 


MACCHIU Pee aie MACCHIU 


and Move HI 
31 26 25 21 20 16 15 1110 0) 
SPECIAL rs rt rd MACCHIU 
000000 01101011001 
6 5 5 5 11 
Format: 
MACCHIU rd, rs, rt (VR5432 format) 
Description: 


The unsigned 32-bit operands in the rs and rf registers are multiplied, and the 
product is added to the unsigned contents of the 64-bit accumulator formed by the 
least-significant 32 bits of the HI and LO registers. A copy of the most-significant 
32 bits of the result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MACCHIU instruction. 


Operation: 


32, 64 T: — Hlg4_.9 || LO31...0 — (Alg1._.0 |] LOs1...0) + (GPR[rs] * GPRI[rt]) 
GPR[rd]31..0 — (Hl34._.9 || LO31__.0) + (GPRIrs] * GPR[rt]))¢3..32 


Exceptions: 


None 
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Unsigned Multiply, 
MACCU Recuntiate Sid ae LO MACCU 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MACCU 
000000 00101011001 

6 5 5 5 11 
Format: 
MACCU rd, rs, rt (VR5432 format) 
Description: 


The unsigned 32-bit operands in the rs and rt registers are multiplied, and the 
product is added to the unsigned contents of the 64-bit accumulator formed by the 
least-significant 32 bits of the HI and LO registers. A copy of the least-significant 
32 bits of the result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MACCU instruction. 


Operation: 


32, 64 T: — Algy_.0 |] LO3t..0  (Ala1...0 || LO31...0) + (GPRIrs] * GPR[rt]) 
GPR[rd]31_.9 — ((Hl31,_.9 || LO31...0) + (GPR[rs] * GPR[rt]))31..0 


Exceptions: 


None 
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CPU Instruction Set 


Move from 
M FCO System Control Coprocessor M FCO 


31 26 25 21 20 16 15 1110 0 
COPO MF rt rd 0 
010000 ; 00000 00000000000 
6 5 5 5 11 
Format: 
MEFCO rt, rd (MIPS I format) 
Description: 


The contents of general-purpose register rd of the CPO are loaded into general- 
purpose register rt. 


Operation: 


32 = T: data< CPR[O,rd] 
T+1: GPRirt] < data 


64 TT: data< CPR[0,rd] 
T+1: GPR[rt] < (datas;)°? || datas;_0 


Exceptions: 


Coprocessor Unusable exception (64-/32-bit User and Supervisor mode if CPO is 
disabled) 
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M FC Z Move from Coprocessor z M FC Z 


31 26 25 21 20 16 15 11 10 0 
COPz MF rt rd 0 
0100xx*;} 00000 00000000000 
6 5 5 5 11 
Format: 
MFCz rt, rd (MIPS I format) 
Description: 


The contents of general-purpose register rd of CPz are loaded into general- 
purpose register rt. 


Operation: 

32 T: data <— CPR[z,rd] 
T+1: GPR[rt] < data 

64 T: if rdg = 0 then 


data <~ CPR[Zz, rd4__4 || O]31...0 
else 
data <~ CPRIZ, rdq..4 || Oles...32 
endif 
T+1: GPR[rt] — (datas;)°? || data 


Exceptions: 


Coprocessor Unusable exception 
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CPU Instruction Set 


Move from Coprocessor z 
M FCz (continued) M FCz 


Opcode Bit Encoding: 


MFCz Bit #31 30 29 28 27 26 25 24 23 22 21 0 
MFCO 


Bit #31 30 29 28 27 26 25 24 23 22 21 0 
MFC1 


Bit #31 30 29 28 27 26 25 24 23 22 21 0 
MFC2 


Opcode 


= Coprocessor Sub-opcode 
Coprocessor Number 


Vr5432 Microprocessor User’s Manual 465 


Chapter 17 


Move from 
M F D R Debug Register M F D R 


31 26 25 21 20 16 15 1110 65 0 
SPECIAL2 MFDR rt dr 0 Debug Move 
011100 | 00000 00000 111101 

6 5 5 5 5 6 
Format: 
MEDR tt, dr (VR5432 format) 
Description: 


The contents of debug register dr are loaded into general-purpose register rt. 


Operation: 


32,64T: GPR[rt] < DEBUG|dr] 


Exceptions: 


None 
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MFHI 


CPU Instruction Set 


Move from HI M FH | 
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31 26 25 16 15 1110 6 5 0 
SPECIAL 0 rd 0 MFHI 
000000 | 0000000000 00000 010000 
6 10 5 5 6 

Format: 
MFHI rd (MIPS I format) 

Description: 
The contents of special register HI are loaded into general-purpose register rd. 
To ensure proper operation in the event of interrupts, the two instructions that 
follow an MFHI instruction may not be any of the instructions that modify the HI 
register: DDIV, DDIVU, DIV, DIVU, DMULT, DMULTU, MAC, MACC, 
MACCHI, MACCHIU, MACCU, MTHI, MUL, MULHI, MULHIU, MULT, 
MULTU, or MULU. 

Operation: 

32, 64 T: GPR[rd] < HI 

Exceptions: 
None 
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MELO Move from LO MELO 


31 26 25 16 15 1110 6 5 0 
SPECIAL 0 rd 0 MFLO 
000000 | 0000000000 00000 010010 
6 10 5 5 6 

Format: 
MELO rd (MIPS I format) 
Description: 


The contents of special register LO are loaded into general-purpose register rd. 


To ensure proper operation in the event of interruptions, the two instructions that 
follow an MFLO instruction may not be any of the instructions that modify the L 
register: DDIV, DDIVU, DIV, DIVU, DMAC, DMULT, DMULTU, MAC, 
MACC, MACCHI, MACCHIU, MACCU, MTLO, MUL, MULHI, MULHIU, 
MULT, MULTU, or MULU. 


Operation: 


32, 64 T: GPR{[rd] — LO 


Exceptions: 


None 
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CPU Instruction Set 


Move from 
M F PC Performance Counter M F P C 


31 26 25 21 20 16 15 11 10 65 10 
COPO MFPC rt CPO Move 0 reg 1 
010000 00000 11001 00000 
6 5 5 5 5 5 1 
Format: 
MFPC rt, reg (VR5432 format) 
Description: 


The contents of Performance Counter reg are loaded into general-purpose register 
rt. 


Operation: 


32,64 T: GPR[rt] < CPR[0,reg] 


Exceptions: 


Coprocessor Unusable exception 
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Move from 
M F PS Performance Event Specifier M F PS 
31 26 25 21 20 16 15 11 10 65 10 
COPO MFPS rt CPO Move 0 reg 0 
010000 00000 11001 00000 
6 5 5 5 5 5 1 
Format: 
MFPS rt, reg (VR5432 format) 
Description: 


The contents of performance event specifier reg are loaded into general-purpose 


register rt. 


Operation: 


32,64 T: GPR[rt] < CPR[0,reg] 


Exceptions: 


Coprocessor Unusable exception 


470 Vr5432 Microprocessor User’s Manual 


CPU Instruction Set 


MOVN Move Conditional on Not Zero MOVN 


31 26 25 21 20 1615 1110 65 0 
SPECIAL rs rt rd 0 MOVN 
000000 00000 |} 001011 

6 5 5 5 5 6 
Format: 
MOVN rd, rs, rt (MIPS IV format) 
Description: 


If the value in general-purpose register rt is not equal to zero, then the contents of 
general-purpose register rs are placed into general-purpose register rd. 


Operation: 


if GPR[rt] 4 0 then 
GPR[rd] < GPR[rs] 
endif 


Note: The nonzero value tested here is the condition true result from the 
SLT, SLTI, SLTU, and SLTIU comparison instructions. 


Exceptions: 


Reserved Instruction exception 
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M OVZ Move Conditional on Zero M OVZ 


31 26 25 21 20 1615 1110 65 0 
SPECIAL rs rt rd 0 MOVZ 
000000 00000 | 001010 

6 5 5 5 5 6 
Format: 
MOVZ rd, rs, rt (MIPS IV format) 
Description: 


If the value in general-purpose register rt is equal to zero, then the contents of 
general-purpose register rs are placed into general-purpose register rd. 


Operation: 


if GPR[rt] = 0 then 
GPR[rd] <— GPRirs] 
endif 


Note: The nonzero value tested here is the condition false result from the 
SLT, SLTI, SLTU, and SLTIU comparison instructions. 


Exceptions: 


Reserved Instruction exception 
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CPU Instruction Set 


MSAC Multiply, Negate, MSAC 


Accumulate, and Move LO 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MSAC 
000000 00111011000 

6 5 5 5 11 
Format: 
MSAC rd, rs, rt (VR5432 format) 
Description: 


The signed 32-bit operands in the rs and rt registers are multiplied, and the product 
is subtracted from the signed contents of the 64-bit accumulator formed by the 
least-significant 32 bits of the HI and LO registers. A copy of the least-significant 
32 bits of the result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MSAC instruction. 


Operation: 


32, 64 T: —— Hlg4_.9 {| LO31...0 — (Hlg1._.0 |] LOs1...0) - (GPRI[rs] * GPR[rt]) 
GPR[rd]31_.0 — (HI31__.9 || LOgy... ) - (GPR[rs] * GPR[rt]))31..0 


Exceptions: 


None 
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Multiply, Negate, 
MSACHI Accumulate, and Move HI MSACHI 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MSACHI 
000000 01111011000 

6 5 5 5 11 
Format: 
MSACHI rd, rs, rt (VR5432 format) 
Description: 


The signed 32-bit operands in the rs and rt registers are multiplied, and the product 
is subtracted from the signed contents of the 64-bit accumulator formed by the 
least-significant 32 bits of the HI and LO registers. A copy of the most-significant 
32 bits of the result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MSACHI instruction. 


Operation: 


32, 64 T: —— Hlg4_.9 [|] LO31...0 — (Hlg1._.0 |] LOs1...0) - (GPRI[rs] * GPRI[rt]) 
GPR[rd]31_.0 <— (HI31...9 || LOgy... ) - (GPR[rs] * GPR[rt]))63..32 


Exceptions: 


None 
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CPU Instruction Set 


MSACHIU a MSACHIU 


and Move HI 
31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MSACHIU 
000000 01111011001 
6 5 5 5 11 
Format: 
MSACHIU rd, rs, rt (VR5432 format) 
Description: 


The unsigned 32-bit operands in the rs and rt registers are multiplied, and the 
product is subtracted from the unsigned contents of the 64-bit accumulator formed 
by the least-significant 32 bits of the HI and LO registers. A copy of the most- 
significant 32 bits of the result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MSACHIU instruction. 


Operation: 


32, 64 T: — HAllg4..9 || LO31..0 — (l31...0 |] LO31...0) - (GPR[rs] * GPR[rt]) 
GPR[rd]31.0 <— ((HI31..0 || LO31.. ) = (GPRirs] * GPRIrt]))63. 32 


Exceptions: 


None 


Vr5432 Microprocessor User’s Manual 475 


Chapter 17 


MSACU Unsigned Multiply, Negate, MSACU 


Accumulate, and Move LO 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MSACU 
000000 00111011001 

6 5 5 5 11 
Format: 
MSACU rd, rs, rt (VR5432 format) 
Description: 


The unsigned 32-bit operands in the rs and rf registers are multiplied, and the 
product is subtracted from the unsigned contents of the 64-bit accumulator formed 
by the least-significant 32 bits of the HI and LO registers. A copy of the least- 
significant 32 bits of the result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MSACU instruction. 


Operation: 


32, 64 T: —— Hlg4_.9 [|] LO31...0 — (Hlg1._.0 |] LOg1...0) - (GPRI[rs] * GPRI[rt]) 
GPR[rd]31_.0 — (Hl31._.9 || LOgy... ) - (GPR[rs] * GPRI[rt]))31..0 


Exceptions: 


None 
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CPU Instruction Set 


Move to 
MTCO System Control Coprocessor MTCO 


31 26 25 21 20 16 15 1110 0 
COPO MT rt rd 0 
010000 00100 00000000000 
6 5 5C 5 11 


Format: 
MTCO rt, rd (MIPS I format) 


Description: 


The contents of general-purpose register rt are loaded into general-purpose 
register rd of CPO. 


Because the contents of the TLB may be altered by this instruction, the operation 
of Load and Store instructions and TLB operations for the instructions 
immediately before and after this instruction are undefined. 


Operation: 

32,64 iT: data <— GPR[rt] 
T+1: CPR[O, rd] < data 

Exceptions: 


Coprocessor Unusable exception 


Vr5432 Microprocessor User’s Manual 477 


Chapter 17 


MTCz Move to Coprocessor z M TC Z 
0 


31 26 25 21 20 16 15 11 10 
COPz MT rt rd 0 
0100xx*|} 00100 00000000000 
6 5 5 5 11 
Format: 
MTCz rt, rd (MIPS I format) 
Description: 
The contents of general-purpose register rt are loaded into general-purpose 
register rd of CPz. 
Operation: 
32 =T: data < GPRirt] 
T+1: CPR[z, rd] < data 
64 T: data< GPRi[rt]31 0 
T+1: if rdpg = 0 
CPRIZ, rd4__4 || 0] < CPRIZz, rq... || Oles...32 || data 
else 
CPRIZz, rdq_.4 || 0] <— data || CPR[z, rd4__4 || O31... 
endif 
Opcode Bit Encoding: 
MTCz Bit #31 30 29 28 27 26 25 24 23 22 21 0 
MTCO 
Bit #31 30 29 28 27 26 25 24 23 22 21 0 
MTC1 
Bit #31 30 29 28 27 26 25 24 23 22 21 0 
MTC2 
IT 
Opcode 
e = Coprocessor Sub-opcode 
Coprocessor Number 


Exceptions: 


Coprocessor Unusable exception 
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MT D R sane Beaisiai MT D R 


31 26 25 21 20 16 15 11 10 65 0 
SPECIAL2 MTDR rt dr 0 Debug Move 
011100 ; 00100 00000 111101 

6 5 5 5 5 6 
Format: 
MTDR rt, dr (VR5432 format) 
Description: 


The contents of general-purpose register rt are loaded into debug register dr. 


Operation: 


32,64T: | DEBUG[dr] < GPRIrt] 


Exceptions: 


None 
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MTHI Move to HI 


MTHI 


31 26 25 21 20 65 0 
SPECIAL rs 0 MTHI 
000000 000000000000000 010001 
6 5 15 6 
Format: 
MTHI rs (MIPS I format) 
Description: 


The contents of general-purpose register rs are loaded into special register HI. 


If the MTHI instruction is executed following the MULT, MULTU, DIV, or 
DIVU instruction, the operation is performed normally. However, if the MFLO, 
MFHI, MTLO, or MTHI instruction is executed following the MTHI instruction, 


the contents of special register LO are undefined. 


Operation: 
32,64 T-2: HI < undefined 
T-1: HI < undefined 
Te HI <— GPR{rs] 
Exceptions: 
None 
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CPU Instruction Set 


MTLO Move 190 MTLO 


31 26 25 2120 65 0 
SPECIAL rs 0 MTLO 
000000 000000000000000 010011 
6 5 15 6 
Format: 
MTLOrs (MIPS I format) 
Description: 


The contents of general-purpose register rs are loaded into special register LO. 


If the MTLO instruction is executed following the MULT, MULTU, DIV, or 
DIVU instruction, the operation is performed normally. However, if the MFLO, 
MFHI, MTLO, or MTH1 instruction is executed following the MTLO instruction, 
the contents of special register HI are undefined. 


Operation: 


32,64 T-2: LO < undefined 
T-1: LO < undefined 
T: LO «+ GPRIrs] 


Exceptions: 


None 
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Move to 
MTPC Performance Counter MTPC 


31 26 25 21 20 16 15 11 10 65 10 
COPO MTPC rt CPO Move 0 reg 1 
010000 00100 11001 00000 
6 5 5 5 5 5 1 
Format: 
MTPC rt, reg (VR5432 format) 
Description: 


The contents of general-purpose register rt are loaded into Performance Counter 
reg. 


Operation: 


32,64 T: CPR[0,reg] < GPRirt] 


Exceptions: 


Coprocessor Unusable exception 
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Move to 
MT PS Performance Event Specifier MTPS 


31 26 25 21 20 16 15 11 10 65 10 
COPO MTPS rt CPO Move 0 reg 0 
010000 00100 11001 00000 
6 5 5 5 5 5 1 
Format: 
MTPS rt, reg (VR5432 format) 
Description: 


The contents of general-purpose register rt are loaded into performance event 
specifier reg. 


Operation: 


32,64 T: CPR[0,reg] < GPRirt] 


Exceptions: 


Coprocessor Unusable exception 
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MUL Multiply and Move LO MUL 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MUL 
000000 00001011000 
6 5 5 5 11 
Format: 
MUL rd, rs, rt (VR5432 format) 
Description: 


The signed 32-bit operands in the rs and rt registers are multiplied, and the product 
is stored in the 64-bit register formed by the least-significant 32 bits of the HI and 
LO registers. A copy of the least-significant 32 bits of the result is stored in 
general-purpose register rd. 


An Integer Overflow exception never occurs. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MUL instruction. 


Operation: 


32, 64 T: — Hls4_9 || LO31..9 < GPRIrs] * GPR[rt] 
GPRIrd]31_9 < (GPRIrs] * GPR[rt])31.0 


Exceptions: 


None 
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MULHI Multiply and Move HI MULHI 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MULHI 
000000 01001011000 

6 5 5 5 11 
Format: 
MULHI rd, rs, rt (VR5432 format) 
Description: 


The signed 32-bit operands in the rs and rt registers are multiplied, and the product 
is stored in the 64-bit register formed by the least-significant 32 bits of the HI and 
LO registers. A copy of the most-significant 32 bits of the result is stored in 
general-purpose register rd. 


An Integer Overflow exception never occurs. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MULHI instruction. 


Operation: 


32, 64 T: — Hls4_.9 || LO31..9 < GPRIrs] * GPR[rt] 
GPR[rd]31..9 <— (GPR[rs] * GPR[rt])63, 30 


Exceptions: 


None 
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MULHIU me irr in MULHIU 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MULHIU 
000000 01001011001 

6 5 5 5 11 
Format: 
MULHIU rd, rs, rt (VR5432 format) 
Description: 


The unsigned 32-bit operands in the rs and rf registers are multiplied, and the 
product is stored in the 64-bit register formed by the least-significant 32 bits of the 
HI and LO registers. A copy of the most-significant 32 bits of the result is stored 
in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MULHIU instruction. 


Operation: 


32, 64 T: — Hlg4_9 || LO31_.9 < GPRIrs] * GPR[rt] 
GPRI[rd]31_.9 < (GPR[rs] * GPR[rt])¢3, 30 


Exceptions: 


None 


486 Vr5432 Microprocessor User’s Manual 


CPU Instruction Set 


M U LS Multiply, Negate, and Move LO M U LS 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MULS 
000000 00011011000 

6 5 5 5 11 
Format: 
MULS rd, rs, rt (VR5432 format) 
Description: 


The signed 32-bit operands in the rs and rt registers are multiplied, and the product 
is negated and stored in the 64-bit register formed by the least-significant 32 bits 
of the HI and LO registers. A copy of the least-significant 32 bits of the result is 
stored in general-purpose register rd. 


An Integer Overflow exception never occurs. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MULS instruction. 


Operation: 


32,64. T: — Hlg4.9 || LO31..9 < 0- (GPRIrs] * GPR[r]) 
GPR[rd]s1..9 < (0 — (GPR[rs] * GPR[rt]))31..0 


Exceptions: 


None 
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MULSHI a le MULSHI 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MULSHI 
000000 01011011000 

6 5 5 5 11 
Format: 
MULSHI rd, rs, rt (VR5432 format) 
Description: 


The signed 32-bit operands in the rs and rt registers are multiplied and the product 
is negated and stored in the 64-bit register formed by the least-significant 32 bits 
of the HI and LO registers. A copy of the most-significant 32 bits of the result is 
stored in general-purpose register rd. 


An Integer Overflow exception never occurs. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MULSHI instruction. 


Operation: 


32, 64 T: — Hlgq_9 || LO31..9 < 0- (GPRIrs] * GPRIrt]) 
GPRIrd]31..9 <— (0 — (GPR[rs] * GPR[rt]))¢3. 32 


Exceptions: 


None 
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MULSHIU ee ie a MULSHIU 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MULSHIU 
000000 01011011001 

6 5 5 5 11 
Format: 
MULSHIU rd, rs, rt (VR5432 format) 
Description: 


The unsigned 32-bit operands in the rs and rf registers are multiplied and the 
product is negated and stored in the 64-bit register formed by the least-significant 
32 bits of the HI and LO registers. A copy of the most-significant 32 bits of the 
result is stored in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MULSHIU instruction. 


Operation: 


32,64. = T: ~—_ Hil34..9 || LOg1,.9 < O- (GPRIrs] * GPRIrt]) 
GPR[rd]31_.0 — (0 — (GPRI[rs] * GPRI[rt]))63..32 


Exceptions: 


None 
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Unsigned Multiply, 
M U LS U sence and ages i. M U LS U 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MULSU 
000000 00011011001 

6 5 5 5 11 
Format: 
MULSU rd, rs, rt (VR5432 format) 
Description: 


The unsigned 32-bit operands in the rs and rf registers are multiplied and the 
product is stored in the 64-bit register formed by the least-significant 32 bits of the 
HI and LO registers. A copy of the least-significant 32 bits of the result is stored 
in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MULSU instruction. 


Operation: 


32,64  T: — Hlsq.9 || LO31..9 < 0- (GPRIrs] * GPRI[rt]) 
GPR[rd]s1..9 < (0 — (GPR[rs] * GPR[rt]))31..0 


Exceptions: 


None 
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Multiply M U LT 


31 26 25 21 20 16 15 6 5 0 
SPECIAL rs rt 0 MULT 
000000 0000000000 011000 

6 5 5 10 6 
Format: 
MULT rs, rt (MIPS I format) 
Description: 


The contents of general-purpose registers rs and rt are multiplied, treating both 
operands as 32-bit signed integers. An Integer Overflow exception never occurs. 


In 64-bit mode, the operands must be valid 32-bit, sign-extended values. 


When the operation completes, the low-order word of the doubleword result is 
loaded into special register LO, and the high-order word of the doubleword result 
is loaded into special register HI. In the 64-bit mode, the respective results are sign 
extended and stored. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MULT instruction. 
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MULT 


Multiply 
(continued) 


MULT 


492 


Operation: 
32 T-2: LO < undefined 
HI < undefined 
T-1: LO < undefined 
HI < undefined 
T: ot < GPR[rs] * GPR[rt] 
LO — tg1..0 
H | = t63...32 
64 T-2: LO < undefined 
HI < undefined 
T-1: LO < undefined 
HI < undefined 
T: t <— GPAlrs}31_.0 “ GPR[rils1._.0 
LO e (t31) 35 || t31...0 
HI — (tg3)"* || te3...32 
Exceptions: 
None 
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M U LTU Unsigned Multiply M U LTU 


31 26 25 21 20 16 15 6 5 0 
SPECIAL rs rt 0 MULTU 
000000 0000000000 011001 
6 5 5 10 6 
Format: 
MULTUrs, rt (MIPS I format) 
Description: 


The contents of general-purpose registers rs and rt are multiplied, treating both 
operands as 32-bit unsigned values. An Integer Overflow exception never occurs. 


In 64-bit mode, the operands must be valid 32-bit, sign-extended values. 


When the operation completes, the low-order word of the doubleword result is 
loaded into special register LO, and the high-order word of the doubleword result 
is loaded into special register HI. In 64-bit mode, these results are sign extended 
and loaded. 


If either of the two preceding instructions is MFHI or MFLO, the execution results 
of these transfer instructions are undefined. To obtain the correct result, insert two 
or more additional instructions between MFHI or MFLO and the MULTU 
instruction. 
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MULTU 


Operation: 


32 T-2: LO 


64 T-2: LO 


Exceptions: 


None 


494 


Unsigned Multiply 


MULTU 


(continued) 


< undefined 

< undefined 

< undefined 

< undefined 

<— (0 || GPR[rs]) * (0 || GPR[rt]) 
1571.0 

— te3...32 


< undefined 

< undefined 

< undefined 

< undefined 

<= (0 || GPRI[rs]31 ...0) * (0 || GPR[rt]31._.0) 
= (t31) 35 || t31...0 

<— (tg3)°* || te3...32 
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M U L U Unsigned Multiply and Move LO M U L U 


31 26 25 21 20 16 15 1110 0 
SPECIAL rs rt rd MULU 
000000 00001011001 

6 5 5 5 11 
Format: 
MULU rd, rs, rt (VR5432 format) 
Description: 


The unsigned 32-bit operands in the rs and rf registers are multiplied and the 
product is stored in the 64-bit register formed by the least-significant 32 bits of the 
HI and LO registers. A copy of the least-significant 32 bits of the result is stored 
in general-purpose register rd. 


If either of the two instructions immediately preceding this instruction is the 
MFHI or MFLO instruction, the execution result of the transfer instruction is 
undefined. To obtain correct results, insert two or more other instructions between 
MFHI or MFLO and the MULU instruction. 


Operation: 


32, 64 T: — Hg4_9 || LO31_.9 < GPRIrs] * GPR[rt] 
GPRIrd]31_9 < (GPRIrs] * GPRIrt])31 0 


Exceptions: 


None 
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NOR NOR NOR 
31 26 25 21 20 16 15 11 10 6 5 
SPECIAL rs rt rd 0 NOR 
000000 00000 100111 
6 5 5 5 5 6 
Format: 
NOR rd, rs, rt (MIPS I format) 

Description: 


The contents of general-purpose register rs are bitwise NORed with the contents 
of general-purpose register rt. The result is stored in general-purpose register rd. 


Operation: 


32, 64 


T: GPR{[rd] < GPR[rs] nor GPR[rt] 


Exceptions: 


None 
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OR oF OR 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 OR 
000000 00000 100101 
6 5 5 5 5 6 
Format: 
OR rd, rs, rt (MIPS I format) 
Description: 


The contents of general-purpose register rs are bitwise ORed with the contents of 
general-purpose register rt. The result is stored in general-purpose register rd. 


Operation: 


32, 64 T: GPRi[rd] < GPR[rs] or GPR{[rt] 


Exceptions: 


None 
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ORI OR Immediate ORI 


31 26 25 21 20 16 15 0) 
ORI rs rt immediate 
001101 
6 5 5 16 
Format: 
ORI rt, rs, immediate (MIPS I format) 
Description: 


The 16-bit immediate is zero extended and bitwise ORed with the contents of 
general-purpose register rs. The result is stored in general-purpose register rt. 


Operation: 


32 T: GPR[rt] — GPR[rs]31.. 146 || (immediate or GPR[rs]15__ 0) 
64 T: GPRirt]< GPRI[rs]¢3.. 16 || (immediate or GPR[rs];5__ 0) 


Exceptions: 


None 
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PREF prefetch PREF 


31 26 25 21 20 1615 0 
PREF base hint offset 
110011 
6 5 5 16 
Format: 
PREF hint, offset (base) (MIPS IV format) 
Description: 


PREF adds the 16-bit signed offset to the contents of general-purpose register base 
to form an effective byte address. It advises that data at the effective address may 
be used in the near future. The hint field supplies information about the way the 
data is expected to be used. 


Unlike the Vr5000, in which the PREF instruction is executed as an NOP, the 
VR5432 data may be prefetched into the data cache as a result of executing this 
instruction. 


PREF is an advisory instruction that may change the performance of the program. 
However, for all hint values and all effective addresses, it neither changes the 
architecturally visible state nor alters the meaning of the program. 


If MIPS IV instructions are supported and enabled, PREF does not cause 
addressing-related exceptions. If it does happen to raise an exception condition, 
the exception condition is ignored. If an addressing-related exception condition is 
raised and ignored, no data is prefetched. However, even if no data is prefetched, 
some action that is not architecturally visible—such as write-back of a dirty cache 
line—can take place. 


If PREF results in a memory operation, the memory access type used for the 
operation is determined by the memory access type of the effective address, just 
as it would be if the memory operation had been caused by a load or store to the 
effective address. 


The hint field supplies information about the way the data is expected to be used. 
A hint value cannot cause an action to modify an architecturally visible state. A 

processor may use a hint value to improve the effectiveness of the prefetch action. 
The defined hint values are shown in Table 17-17. 
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PREF 


Prefetch Pp R E F 


(continued) 


Table 17-17 Values of Hint Field for PREF Instruction 
Value Name Data Use and Desired Prefetch Action 
0 feud Data is expected to be loaded (not modified). 
Sl Fetch data as if for a load. 
Data is expected to be stored or modified. 
1 store : 
Fetch data as if for a store. 
2-3 Reserved 
Data is expected to be loaded (not modified) but not reused 
4 ae ere extensively; it “streams” through the cache. 
- Fetch data as if for a load and place it in the cache so that it does 
not displace data prefetched as “retained.” 
Data is expected to be stored or modified but not reused 
5 Sis hana extensively; it “streams” through the cache. 
7 Fetch data as if for a store and place it in the cache so that it does 
not displace data prefetched as “retained.” 
Data is expected to be loaded (not modified) and reused 
6 Ras gered extensively; it should be “retained” in the cache. 
> Fetch data as if for a load and place it in the cache so that it is 
not displaced by data prefetched as “streamed.” 
Data is expected to be stored or modified and reused 
7 Dotes apenesy extensively; it should be “retained” in the cache. 
7 Fetch data as if for a store and place it in the cache so that it is 
not displaced by data prefetched as “streamed.” 
8-24 Reserved 
25 writeback_invalidate 
26-31 Reserved 
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PREF sens PREF 


(continued) 


PREF never generates a memory operation for a location with an uncached 
memory access type. 


Prefetch cannot prefetch data from a mapped location unless the translation for 
that location is present in the TLB. Locations in memory pages that have not been 
accessed recently may not have translations in the TLB, so prefetch may not be 
effective for such locations. 


Prefetch does not cause addressing exceptions. It does not cause an exception to 
prefetch using an address pointer value before the validity of a pointer is 
determined. 


Operation: 


vAddr < GPRibase] + sign_extend(offset) 
(pAddr, CCA) < AddressTranslation(vAddr, DATA, LOAD) 
Prefetch(CCA, pAddr, vAddr, DATA, hint) 


Exceptions: 


Reserved Instruction exception 
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ROR Rotate Right ROR 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 1 rt rd sa ROR 
000000 00001 000010 
6 5 5 5 5 6 

Format: 
ROR rd, rt, sa (VR5432 format) 
Description: 


The contents of general-purpose register rt are rotated right by sa bits. The result 
is stored in general-purpose register rd. 


Operation: 


32,64T: GPRird] — GPR[rt]sa-1..0 || GPRIrt]s1.sa 


Exceptions: 


None 
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RORV Rotate Right Variable RORV 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 1 RORV 
000000 00001 000110 
6 5 5 5 5 6 

Format: 
RORY rd, rt, rs (VR5432 format) 
Description: 


The contents of general-purpose register rf are rotated right by the number of bits 
specified by the low-order five bits of general-purpose register rs. The result is 
stored in general-purpose register rd. 


Operation: 


32,64T: s<GPRIrs]q o 
GPR[rd] <— GPR[ri]s-1__.0 || GPR[rt}31..s 


Exceptions: 


None 
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S B Store Byte S B 


31 26 25 21 20 16 15 0 
SB base rt offset 
101000 
6 5 5 16 
Format: 
SB rt, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The least-significant byte of register rt is 
stored at the memory location specified by the address. 


Operation: 


32 T:  vAddr < ((offset;s)'® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgize-1...3 || (DAddre._ 9 xor ReverseEndian’®) 


byte < vAddry_ xor BigEndianCPU? 

data — GPR[rtles_s+byte...0 || 08 >" 

StoreMemory (uncached, BYTE, data, pAddr, vAddr, DATA) 
64 T:  vAddr < ((offsetss)*? || offsetis._0) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA) 

pAddr <— pAddrpgiz_-1...3 || (PAddro. 9 xor ReverseEndian’*) 

byte < vAddro 9 xor BigEndianCPU® 

data — GPRIrtles_g+byte...0 || 08 >" 

StoreMemory (uncached, BYTE, data, pAddr, vAddr, DATA) 


Exceptions: 


TLB Miss exception 
TLB Invalid exception 
TLB Modified exception 
Bus Error exception 
Address Error exception 
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Store Conditional SC 


31 


26 25 21 20 16 15 0 


base rt offset 


5 5 16 


Format 


SC rt, offset (base) (MIPS II format) 


Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of general-purpose register rt 
are stored at the memory location specified by the address only when the LL bit is 
set. If another processor or device changes the physical address after the previous 
LL instruction has been executed, or if the ERET instruction exists between the 
LL and SC instructions, the register contents are not stored to memory, and storing 
fails. 


This instruction is provided for compatibility with MIPS implementations that 
implement multiprocessing facilities. The VR5432 does not implement these 
facilities. 


The success or failure of the SC operation is indicated by the contents of general- 
purpose register rt after execution of the instruction. A successful SC instruction 
sets the contents of general-purpose register 7¢ to 1; an unsuccessful SC instruction 
sets them to 0. 


The operation of SC is undefined when the address is different from the address 
used in the last LL instruction. 


This instruction is available in User mode; it is not necessary for CPO to be 
enabled. 


If either of the low-order two bits of the address is not zero, an Address Error 
exception takes place. 


If this instruction both fails and causes an exception, the exception takes 
precedence. 
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Store Conditional 
(continued) SC 


Operation: 


32 


64 


T: 


vAddr < ((offsetys)'® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data <— GPRIrt]31_0 
if LLbit then 
StoreMemory (uncached, WORD, data, pAddr, vAddr, DATA) 
endif 
GPRIirt] < 03" || LLbit 


vAddr < ((offset;s)** || offset;5 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data <— GPRIrt]31_0 
if LLbit then 
StoreMemory (uncached, WORD, data, pAddr, vAddr, DATA) 
endif 
GPRIirt] — 0° || LLbit 
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Exceptions: 


TLB Miss exception 
TLB Invalid exception 
TLB Modified exception 
Bus Error exception 
Address Error exception 
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S C D Store Conditional Doubleword SC D 
31 26 25 21 20 16 15 0 
CD base rt offset 
111100 

6 5 5 16 

Format: 

SCD rt, offset (base) (MIPS III format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of general-purpose register rt 
are stored at the memory location specified by the address only when the LL bit is 
set. If another processor or device changes the target address after the previous 
LLD instruction has been executed, or if the ERET instruction exists between the 
LLD and SCD instructions, the register contents are not stored to memory, and 
storing fails. 


This instruction is provided for compatibility with MIPS implementations that 
implement multiprocessing facilities. The VR5432 does not implement these 
facilities. 


The success or failure of the SCD operation is indicated by the contents of general- 
purpose register rt after execution of the instruction. A successful SCD instruction 
sets the contents of general-purpose register rt to 1; an unsuccessful SCD 
instruction sets them to 0. 


The operation of SCD is undefined when the address is different from the address 
used in the last LLD. 


This instruction is available in User mode; it is not necessary for CPO to be 
enabled. 


If any of the low-order three bits of the address is not zero, an Address Error 
exception takes place. If this instruction both fails and causes an exception, the 
exception takes precedence. 


This instruction is defined in 64-bit mode and 32-bit Kernel mode. If this 
instruction is executed in the 32-bit User or Supervisor mode, the Reserved 
Instruction exception occurs. 
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Store Conditional Doubleword 
SC D (continued) SC D 


Operation: 


64 TT:  vAddr < ((offset,s)** || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data < GPRirt] 
if LLbit then 
StoreMemory (uncached, DOUBLEWORD, data, pAddr, vAddr, DATA) 
endif 
GPR{[rt] — 0° || LLbit 


Note: In the 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

TLB Modified exception 

Bus Error exception 

Address Error exception 
Reserved Instruction exception 


508 Vr5432 Microprocessor User’s Manual 


CPU Instruction Set 


S D Store Doubleword S D 
31 26 25 21 20 16 15 0 
SD base rt offset 
111111 
6 5 5 16 
Format: 
SD rt, offset (base) (MIPS III format) 
Description: 
The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of general-purpose register rt 
are stored at the memory location specified by the address. 
If any of the low-order three bits of the address are not zero, an Address Error 
exception occurs. 
This operation is defined in 64-bit mode and 32-bit Kernel mode. Execution of this 
instruction in 32-bit User or Supervisor mode causes a Reserved Instruction 
exception. 
Operation: 
32. T:  vAddr < ((offset;s)'® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data <— GPR[rt] 
StoreMemory (uncached, DOUBLEWORD, data, pAddr, vAddr, DATA) 
64 TT:  vAddr < ((offset;s)*® || offset,s_9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data < GPRirt] 
StoreMemory (uncached, DOUBLEWORD, data, pAddr, vAddr, DATA) 


Note: In 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 
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SD 


510 


Store Doubleword 
(continued) 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

TLB Modified exception 

Bus Error exception 

Address Error exception 
Reserved Instruction exception 


SD 
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SDCz Store Doubleword SDCz 


from Coprocessor z 


31 26 25 21 20 16 15 0 
SDCz base rt offset 
1111xx’* 
6 5 5 16 


Format: 


SDCz rt, offset (base) (MIPS II format) 


Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. Register rt of coprocessor unit z sources a 
doubleword, which the processor writes to the addressed memory location. The 
stored data is defined by individual coprocessor specifications. 


If any of the low-order three bits of the address is not zero, an Address Error 
exception takes place. 


This instruction is not valid for use with CPO. 


When CP1 is specified, the FR bit of the Status register equals 0 and the least- 
significant bit in the rt field is not 0, the operation of this instruction is undefined. 
If the FR bit equals 1, both odd and even registers can be specified by rt. 
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Store Doubleword 
S D Cz from Coprocessor z S D Cz 


(continued) 


Operation: 


32 T: vAddr< ((offsetys)'® || offsetys._ 9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data < GPRi(rt), 
StoreMemory (uncached, DOUBLEWORD, data, pAddr, vAddr, DATA) 


64. T: vAddr < ((offset;s)*® || offset;s__0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data < GPR(rt), 
StoreMemory (uncached, DOUBLEWORD, data, pAddr, vAddr, DATA) 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

TLB Modified exception 

Bus Error exception 

Address Error exception 
Coprocessor Unusable exception 


Opcode Bit Encoding: 
SDCz Bit#31 30 29 28 27 26 0 
SDC1 
Bit#31 30 29 28 27 26 0 
SDC2 
Opcode '— Coprocessor Number 
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Store Doubleword Left S D L 


31 26 25 21 20 16 15 0 
SDL base rt offset 
101100 
6 5 5 16 
Format: 
SDL rt, offset (base) (MIPS IT format) 
Description: 


This instruction is used in combination with the SDR instruction to store the 
doubleword data in the register to the doubleword in the memory that is not at the 
doubleword boundary. The SDL instruction stores the higher portion of the data 
to the memory, while the SDR instruction stores the lower portion. 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to generate a virtual address. Of the doubleword data in the memory 
where the most-significant byte is specified by the generated address, only the 
lower portion of general-purpose register 7t is stored to memory at the same 
doubleword boundary as the target address. Depending on the address specified, 
the number of bytes to be loaded changes from one to eight. 


In other words, first the most-significant byte position of general-purpose register 
rt is stored to the bytes in the addressed memory. If there is data of the low-order 
byte that follows the same doubleword boundary, the operation to store this data 
to the next byte of the memory is repeated. 


Memory 
(Big Endian) 


address 8 | 8 


Register 
9 |10/11|12 /13 | 14]15 | Before 


o 


address 0 | 0 


1/2131/41/516]| 7 \storing |A|B|C/D| E|F|/ GH) $24 


SDL $24,1($0) 


address 8 | 8 | 9 | 10/ 11/12] 13) 14/15 | After 


address 0 | 0 


A\B|C/DI/E_ F| G'Storing 
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Store Doubleword Left 
S D L (continued) S D L 


The Address Error exception does not occur, even if the specified address is not 
located at the doubleword boundary. This operation is defined in the 64-bit mode 
and 32-bit Kernel mode. If this instruction is executed in the 32-bit User or 
Supervisor mode, the Reserved Instruction exception occurs. 


Operation: 


64 TT:  vAddr < ((offset;s)** || offset 45 0) + GPR[base] 
(pAddr, uncached) <— AddressTranslation 
(vAddr, DATA) 
pAddr <— pAddrpsize -1...3 || (pAddr. 9 xor ReverseEndian’®) 
If BigEndianMem = 0 then 
pAddr < pAddr31..3 || 0° 
endif 
byte — vAddro . g xor BigEndianCPU® 
data — 0°F 8PY* || GPRIrtles,_.56-8*byte 
Storememory (uncached, byte, data, pAddr, vAddr, DATA) 


Note: In 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 
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S D L (continued) S D L 


The relationships between the addresses given to the SDL instruction and the 
result (bytes for doublewords in the memory) are shown below: 


SDL 
Register | A B C D E F G H 
Memory | J K L M N O P 
BigEndianCPU = 0 BigEndianCPU = 1 
Offset Offset 
vAddro 9 Destination Type| LEM BEM Destination ype) LEM/BEM 
0 | J KLMNOA| O 0/7 |ABCDEFGH] 7 0 0 
1 I J KLMNAB| 1 0/6 |! ABCDEF G| 6 0 1 
2 | J KLMABC}] 2 0/5 Il! J ABCDE FY] 5 0} 2 
3 | J KLABCD] 3 0;4 || J KABCDE] 4 0/3 
4 | JKABCDE| 4 0/3 IJ KLABC DI] 3 0/4 
5 | JABCDEF] 5 0/2 1 J KLMABC] 2 0 5 
6 | ABCDEFG| 6 0 | 1 1 J KLMNA By 1 0 6 
7 ABCDEFGH| 7 0/0 1 J KLMNOAY 0 0 7 
Note: Type: Access type output to memory (refer to Table 16-3 on 
page 324 for information on byte access within a double- 
word) 
Offset: pAddr» _ , Output to memory 
LEM Little-endian memory (BigEndianMem = 0) 
BEM Big-endian memory (BigEndianMem = 1) 
Exceptions: 


TLB Miss exception 

TLB Invalid exception 

TLB Modified exception 

Bus Error exception 

Reserved Instruction exception 
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SDR 


Store Doubleword Right S D R 


31 26 25 21 20 16 15 0 
SDR base rt offset 
101101 
6 5 5 16 
Format: 
SDR rt, offset (base) (MIPS III format) 
Description: 


This instruction is used in combination with the SDL instruction to store the 
doubleword data in the register to the word data in the memory that is not at the 
doubleword boundary. The SDL instruction stores the higher portion of the data 
to the memory, while the SDR instruction stores the lower portion. 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to generate a virtual address. Of the doubleword data in the memory 
where the least-significant byte is specified by the generated address, only the 
lower portion of general-purpose register rt is stored to memory at the same 
doubleword boundary as the target address. Depending on the address specified, 
the number of bytes to be loaded changes from | to 8. 


In other words, first the least-significant byte position of general-purpose register 
rt is stored to the bytes in the addressed memory. If there is data of the high-order 
byte that follows the same doubleword boundary, the operation to store this data 
to the next byte of the memory is repeated. 


[Memory SY 
(Big Endian) Register 
address 8 | g | 9 | 10/11|12,13/14|15] Before 
cde De Chole aoe eh coe B|c|D/E|F| G]H] $24 
SDR $24,10($0) 
address8 | 8 | 9 | 10/ 11/12/13 14/15 pris 
addressO [E|F |G Alege 
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Store Doubleword Right 


(continued) S D R 


The Address Error exception does not occur, even if the specified address is not 
located at the doubleword boundary. This operation is defined in the 64-bit mode 
and 32-bit Kernel mode. If this instruction is executed in the 32-bit User or 
Supervisor mode, the Reserved Instruction exception occurs. 


Operation: 


vAddr < ((offsetys)“° || offset 45.0) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA 

pAddr <— pAddrpsize - 1...3 || (pDAddre._ 9 xor ReverseEndian®) 

If BigEndianMem = 0 then 

pAddr — pAddrpgize - 1... || 0° 

endif 

byte <— vAddro_ 9 xor BigEndianCPU® 

data < GPRItles_¢*byte || 08 >" 

StoreMemory (uncached, DOUBLEWORD-byte, data, pAddr, vAddr, 
DATA) 


Note: In 32-bit Kernel mode, the high-order 32 bits are ignored during 
virtual address creation. 
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Store Doubleword Right 
S D R (continued) S D R 


The relationships between the addresses given to the SDR instruction and the 


result (bytes for doublewords in the memory) are shown below: 


SDR 
Register | A B Cc D E F G H 
Memory | J K L MsiN O P 
BigEndianCPU = 0 BigEndianCPU = 1 
Offset Offset 
vAddro 0 Destination |Type|LEM/BEM Destination Type LEM |BEM 
0 ABCDEFGH| 7 0/0 ;HJ KL MNOP| O 7 | 0 
1 BCDEFGHP| 6 1/0 |GHKLMNOP|} 1 6 | 0 
2 CDEFGHOP| 5 210 |FGHLMNOP} 2 5 | 0 
3 DEF GHNOP| 4 3/0 |EF GHMNOP} 3 4/0 
4 EF GHMNOP) 3 41/0 |DEFGHNOP|} 4 3 / 0 
5 F GHLMNOP) 2 5/0 ;CDEFGHOP| 5 2) 0 
6 GHKLMNOP| 1 6/0 ;,BCDEFGH P| 6 1 0 
7 HJ KLMNOP| O 7/0 |ABCDEFGH| 7 0 | 0 
Note: Type: Access type output to memory (refer to Table 16-3 on 


page 324 for information on byte access within a double- 


word) 


Offset. pAddr , Output to memory 
LEM Little-endian memory (BigEndianMem = 0) 
BEM Big-endian memory (BigEndianMem = 1) 


Exceptions: 


TLB Miss exception 
TLB Invalid exception 
TLB Modified exception 
Bus Error exception 


Reserved Instruction exception 
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SH 


Store Halfword S I+ 


31 26 25 21 20 16 15 0 
SH base rt offset 
101001 
6 5 5 16 
Format: 
SH rt, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The least-significant halfword of register 
rt is stored in the memory specified by the address. 


If the least-significant bit of the address is not zero, an Address Error exception 
occurs. 


Operation: 


32 ys 


64 T: 


vAddr < ((offset;s)'® || offsetys ) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr < pAddrpgize-1...3 || (DAddre. 9 xor (ReverseEndian? || 0)) 
byte < vAddro_ xor (BigEndianCPU? || 0) 
data — GPR[rtles_s+pyte..o || 0°" 

StoreMemory (uncached, HALFWORD, data, pAddr, vAddr, DATA) 
vAddr < ((offset;s)*° || offset,;) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgiz_-1...3 || (DAddro. 9 xor (ReverseEndian? || 0)) 
byte < vAddro_ 9 xor (BigEndianCPU? || 0) 
data <— GPR[rt]g3_s+byte...0 || 0° PY 
StoreMemory (uncached, HALFWORD, data, pAddr, vAddr, DATA) 
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S H Store Halfword S H 


(continued) 


Exceptions: 


TLB Miss exception 
TLB Invalid exception 
TLB Modified exception 
Bus Error exception 
Address Error exception 
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CPU Instruction Set 


Shift Left Logical S L L 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 0 rt rd sa SLL 
000000 00000 000000 
6 5 5 5 5 6 
Format: 
SLL rd, rt, sa (MIPS I format) 
Description: 
The contents of general-purpose register rt are shifted left by sa bits, inserting 
zeros into the low-order bits. The result is stored in general-purpose register rd. In 
the 64-bit mode, the value resulting from sign-extending the shifted 32-bit value 
is stored as a result. If the shift value is 0, the low-order 32 bits of the 64-bit value 
are sign extended. This instruction can generate a 64-bit value that sign-extends a 
32-bit value. 
Operation: 
32 T: GPR[rd] — GPR[rt]34_ sa. || 08? 
64 T: s¢O|lsa 
temp <— GPRIrt]31-s...0 || 0° 
GPR[rd] < (temp31)°? || temp 
Exceptions: 


None 


Caution: If the shift value of this instruction is 0, the assembler may treat 
this instruction as an NOP. When using this instruction for sign 
extension, check the specifications of the assembler. 
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S LLV Shift Left Logical Variable S LLV 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 SLLV 
000000 00000 000100 
6 5 5 5 5 6 
Format: 
SLLV rd, rt, rs (MIPS I format) 
Description: 


The contents of general-purpose register rt are shifted left the number of bits 
specified by the low-order five bits of general-purpose register rs, inserting zeros 
into the low-order bits. The result is stored in general-purpose register rd. In the 
64-bit mode, the value resulting from sign-extending the shifted 32-bit value is 
stored as a result. If the shift value is 0, the low-order 32 bits of the 64-bit value 
are sign extended. This instruction can generate a 64-bit value that sign-extends a 
32-bit value. 


Operation: 


32 T: s¢<GPRIrs]4. 9 
GPR[rd]<— GPR[rt]:31-s)...0 || 0° 
64 T: s€0O]|| GPRI[rs]q 9 
temp <— GPR[rt](31-s)...0 || 0° 
GPR[rd] <— (temp31)°" || temp 


Exceptions: 


None 


Caution: If the shift value of this instruction is 0, the assembler may treat 
this instruction as an NOP. When using this instruction for sign 
extension, check the specifications of the assembler. 
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CPU Instruction Set 


Set On Less Than S LT 


31 26 25 21 20 16 15 11.10 6 5 0 
SPECIAL rs rt rd 0 SLT 
000000 00000 101010 

6 5 5 5 5 6 


Format: 
SLT rd, rs, rt (MIPS I format) 


Description: 


The contents of general-purpose register rt are subtracted from the contents of 
general-purpose register rs. Interpreting these values as signed integers, if the 
contents of general-purpose register rs are less than the contents of general- 
purpose register rf, one is stored in the general-purpose register rd; otherwise, zero 
is stored in general-purpose register rd. 


An Integer Overflow exception never occurs. The comparison is valid even if the 
subtraction used during the comparison overflows. 


Operation: 
32 =T: if GPR[rs] < GPR[rt] then 
GPR[rd] — 0°" || 1 
else 
GPR{rd] — 0°? 
endif 
64 TT: — if GPR[rs] < GPR[rt] then 
GPR[rd] < 0° || 1 
else 
GPR{[rd] — 04 
endif 
Exceptions: 
None 
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S LTI Set On Less Than Immediate S LTl 


31 26 25 21 20 16 15 0 
SLTI rs rt immediate 
001010 
6 5 5 16 
Format: 
SLTI rt, rs, immediate (MIPS I format) 
Description: 


The 16-bit immediate is sign extended and subtracted from the contents of 
general-purpose register rs. Interpreting these values as signed integers, if rs 
contents are less than the sign-extended immediate, one is stored in general- 
purpose register rt; otherwise, zero is stored in the general-purpose register rt. 


An Integer Overflow exception never occurs. The comparison is valid even if the 
subtraction overflows. 


Operation: 


32 T: if GPR[rs] < (immediate,s)'® || immediate;s5 9 then 
GPR[rt] — 03" || 1 
else 
GPRIrt] — 0°? 
endif 


64 TT: if GPR[rs] < (immediate;s)** || immediate;5 . then 
GPRI[rt] — 08 || 1 
else 
GPRIrt] — 0° 
endif 


Exceptions: 


None 
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Set On Less Than 
Immediate Unsigned 


SLTIU 


31 26 25 21 20 16 15 0 
SLTIU rs rt immediate 
001011 
6 5 5 16 
Format: 
SLTIU rt, rs, immediate (MIPS I format) 
Description: 
The 16-bit immediate is sign extended and subtracted from the contents of 
general-purpose register rs. Interpreting these values as unsigned integers, if rs 
contents are less than the sign-extended immediate, one is stored in the general- 
purpose register rt; otherwise zero is stored in the general-purpose register 7t. 
An Integer Overflow exception never occurs. The comparison is valid even if the 
subtraction overflows. 
Operation: 
32 TT: if(0|| GPR[rs]) < (immediate ;5)'® || immediate;s 9 then 
GPRi[rt] < 0°" || 1 
else 
GPRIrt] — 0°? 
endif 
64 TT: if (0 || GPR[rs]) < (immediate ;5)*° || immediate;s 9 then 
GPR[rt] — 08 || 1 
else 
GPRi[rt] — 0° 
endif 
Exceptions: 
None 
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SLTU 


Set On Less Than Unsigned SLTU 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 SLTU 
000000 00000 101011 
6 5 5 5 5 6 

Format: 
SLTU rd, rs, rt (MIPS I format) 
Description: 


The contents of general-purpose register rt are subtracted from the contents of 
general-purpose register rs. Interpreting these values as unsigned integers, if the 
contents of general-purpose register rs are less than the contents of general- 
purpose register rt, one is stored in general-purpose register rd; otherwise, zero is 
stored in the general-purpose register rd. 


An Integer Overflow exception never occurs. The comparison is valid even if the 
subtraction overflows. 


Operation: 
32 TT: if (0 || GPR[rs]) <0 || GPR[rt] then 
GPR[rd] < 0°! || 1 
else 
GPRird] — 092 
endif 
64 TT: if (0 || GPR[rs]) <0 || GPR[rt] then 
GPRI[rd] <— 0° || 1 
else 
GPRird] — 04 
endif 
Exceptions: 
None 
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SRA Shift Right Arithmetic SRA 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 0 rt rd sa SRA 
000000 00000 000011 
6 5 5 5 5 6 

Format: 
SRA rd, rt, sa (MIPS I format) 
Description: 


The contents of general-purpose register rt are shifted right by sa bits, inserting 
signed bits into the high-order bits. The result is stored in general-purpose register 
rd. In 64-bit mode, the sign-extended 32-bit value is stored as the result. 


Operation: 


32 T: GPR[rd] — (GPR[rt]3;)$* || GPR[rt] 34. sa 


64 T: s¢Ol|lsa 
temp <— (GPR[rt]31)° || GPRIrt] 31...s 
GPR[rd] <— (temp31)°" || temp 


Exceptions: 


None 
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Shift Right 
S RAV Arithmetic Variable S RAV 
31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 SRAV 
000000 00000 000111 
6 5 5 5 5 6 
Format: 
SRAV rd, rt, rs (MIPS I format) 

Description: 


The contents of general-purpose register rt are shifted right by the number of bits 
specified by the low-order five bits of general-purpose register rs, sign-extending 
the high-order bits. The result is stored in the general-purpose register rd. In 64- 
bit mode, the sign-extended 32-bit value is stored as the result. 


Operation: 


32 T: s<GPRirs]q. 0 
GPR[rd] — (GPR[rt]31)° || GPR[rt]31._.s 


64 T: s«GPRirs]4 0 
temp <— (GPR[rt]31)° || GPR[rtl31__.s 
GPR[rd] <— (temp31)°" || temp 


Exceptions: 


None 
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SRL Shift Right Logical SR L 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL 0 rt rd sa SRL 
000000 00000 000010 
6 5 5 5 5 6 

Format: 
SRL rd, rt, sa (MIPS I format) 
Description: 


The contents of general-purpose register rt are shifted right by sa bits, inserting 
zeros into the high-order bits. The result is stored in general-purpose register rd. 
In 64-bit mode, the sign-extended 32-bit value is stored as the result. 


Operation: 


32. T: GPRi[rd] << 0 §*|| GPR[rt]s;_ sa 


64 T: s€0Ol|lsa 
temp < 0° || GPR[rt]s1.__s 
GPR[rd] <— (temp3;)°" || temp 


Exceptions: 


None 
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SRLV 


Shift Right Logical Variable S R LV 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 SRLV 
000000 00000 000110 
6 5 5 5 5 6 

Format: 
SRLV rd, rt, rs (MIPS I format) 
Description: 


The contents of general-purpose register rt are shifted right by the number of bits 
specified by the low-order five bits of general-purpose register rs, inserting zeros 
into the high-order bits. The result is stored in general-purpose register rd. In 64- 
bit mode, the sign-extended 32-bit value is stored as the result. 


Operation: 


32 T: 


s< GPRIrs]4_0 
GPR[rd] < 0° || GPR{[rt]31.. 


64 T: s<GPRIrs]q. 0 
temp < 0° || GPR[rt]s1.. 
GPR[rd] < (temp31)°* || temp 
Exceptions: 
None 
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S U B Subtract S U B 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 SUB 
000000 00000 100010 
6 5 5 5 5 6 

Format: 
SUB rd, rs, rt (MIPS I format) 
Description: 


The contents of general-purpose register rt are subtracted from the contents of 
general-purpose register rs. The result is stored into general-purpose register rd. 
In 64-bit mode, the sign-extended 32-bit value is stored as the result. 


An Integer Overflow exception occurs if the carries-out of bits 30 and 31 differ (a 
two’s-complement overflow). The destination register rd is not modified when an 
Integer Overflow exception occurs. 


Operation: 


32. T: GPR[rd] < GPRirs] — GPRI[rt] 


64 T: temp < GPR[rs] — GPR[rt] 
GPR[rd] < (temp31)°? || tempsi._o 


Exceptions: 


Integer Overflow exception 
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S U B U Subtract Unsigned S U B U 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 SUBU 
000000 00000 100011 

6 5 5 5 5 6 
Format: 
SUBU rd, rs, rt (MIPS I format) 
Description: 


The contents of general-purpose register rt are subtracted from the contents of 
general-purpose register rs. The result is stored in general-purpose register rd. In 
64-bit mode, the sign-extended 32-bit value is stored as the result. 


The only difference between this instruction and the SUB instruction is that SUBU 
never causes an Integer Overflow Exception. 


Operation: 


32 T: GPRird] < GPR[rs] — GPRIrt] 


64 T: temp < GPR[rs] — GPR[rt] 
GPR[rd] < (temp31)°? || tempsi..o 


Exceptions: 


None 
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SW Store Word SW 
31 26 25 21 20 16 15 0 
SW base rt offset 
101011 
6 5 5 16 
Format: 


SW tt, offset (base) 


Description: 


(MIPS I format) 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of general-purpose register rt 
are stored in the memory location specified by the address. If either of the low- 
order two bits of the address is not zero, an Address Error exception occurs. 


Operation: 
32—COTT: 
64 


T: vAddr < ((offset;s)*® || offset;s__ 9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data <— GPR[rt]31..0 
StoreMemory (uncached, WORD, data, pAddr, vAddr, DATA) 


vAddr < ((offset;s) ‘© || offsetis._ 9) + GPR[base] 

(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data — GPRirt]31..6 

StoreMemory (uncached, WORD, data, pAddr, vAddr, DATA) 


Exceptions: 


TLB Miss exception 


TLB Invalid exception 


TLB Modified exception 
Bus Error exception 
Address Error exception 


Vr5432 Microprocessor User’s Manual 


533 


Chapter 17 


SWCz 


Store Word from Coprocessor z SWC 


Z 


31 26 25 21 20 16 15 0 
SWCz base rt offset 
1110xx* 
6 5 5 16 
Format: 
SWCz rt, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. Coprocessor register rt of the CPz is stored 


in the addressed memory. The data to be stored is defined by individual 
coprocessor specifications. This instruction is not valid for use with CPO. 


If either of the low-order two bits of the address is not zero, an Address Error 
exception occurs. 


Operation: 


32 


64 


T: vAddr < ((offset;s)'®!! offsety5 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgize-1...3 || (pDAddre. 9 xor (ReverseEndian || 0°)) 
byte < vAddro_ 9 xor (BigEndianCPU || 07) 
data < COPzSW (byte, rt) 
StoreMemory (uncached, WORD, data, pAddr, vAddr, DATA) 


T: vAddr < ((offsetys)*® || offsety5 9) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpgize-1...3 || (DAddre. 9 xor (ReverseEndian || 0°)) 
byte < vAddro_ g xor (BigEndianCPU || 07) 
data < COPzSW (byte,rt) 
StoreMemory (uncached, WORD, data, pAddr, vAddr DATA) 
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Exceptions: 


Store Word from Coprocessor z 


TLB Miss exception 
TLB Invalid exception 
TLB Modified exception 
Bus Error exception 
Address Error exception 
Coprocessor Unusable exception 


(continued) 


CPU Instruction Set 


SWCz 


Opcode Bit Encoding: 
SWCz Bit#31 30 29 28 27 26 0 
SWC1 
Bit#31 30 29 28 27 26 0 
SWC2 
Opcode '— Coprocessor Number 
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SWL 


Store Word Left SWL 


31 26 25 21 20 16 15 0 
SWL base rt offset 
101010 
6 5 5 16 
Format: 
SWL rt, offset (base) (MIPS I format) 
Description: 


This instruction is used in combination with the SWR instruction to store a word 
in a register to a word in memory that is not at the word boundary. The SWL 
instruction stores the higher portion of the data to memory, while the SWR 
instruction stores the lower portion. 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to generate a virtual address. Of the word data in the memory where 
the most-significant byte is specified by the generated address, only the higher 
portion of general-purpose register rt is stored to memory at the same word 
boundary as the target address. 


Depending on the address specified, the number of bytes to be stored changes 
from | to 4. 


In other words, first the most-significant byte position of general-purpose register 
rt is stored to the bytes in the addressed memory. If there is data of the low-order 
byte that follows the same word boundary, the operation to store this data to the 
next byte of the memory is repeated. 


No Address Error exceptions occur when the specified address is not located at the 
word boundary. 


Memory 
(Big Endian) Register 
address 4 4 5 6 7 | Before 
f A B C D 

addressO | O| 1 2 | 3 | storing | | $24 
address4 | 4] 5] 6 | 7 | After SWL$24,1($0) 

address 0 0 A B C | storing 
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SWL Store Word Left SWL 


(continued) 


Operation: 


32 T: vAddr < ((offset;s)'® || offset 45 0) + GPR[base] 
(pAddr, uncached) <- AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpsiz_ -1...3 || (Pp Addro. 9 xor ReverseEndian®) 
If BigEndianMem = 0 then 
pAddr < pAddr31. 2 || 07 
endif 
byte < vAddr;_9 xor BigEndianCPU 
if (VAddro xor BigEndianCPU) = 0 then 
data <— 0° || 0°*"PY'° || GPRIrt]s1._24-8+byte 
else 
data <— 0°*°°PY!° || GPRIrt}g1...24-8*byte || 0°* 
endif 
Storememory (uncached, byte, data, pAddr, vAddr, DATA) 


64 T: vAddr < ((offsetys)*® || offset 45. 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddr31..3 || (pAddre. 9 xor ReverseEndian’) 
If BigEndianMem = 0 then 
pAddr < pAddr3;_ 9 || 07 
endif 
byte < vAddr,._9 xor BigEndianCPU2 
if (VAddro xor BigEndianCPU) = 0 then 
data <— 0° || 07*°FY'° || GPRIrt]s1...24-8%byte 
else 
data — 0°* 8Pyte || GPR[rt]31.. 24-8*byte || 0° 
endif 
StoreMemory (uncached, byte, data, pAddr, vAddr, DATA) 
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SWL 


Store Word Left 
(continued) 


SWL 


The relationships between the contents given to the SWL instruction and the result 
(bytes for words in the memory) are shown below: 


SWL 
Register A B C D E F G H 
Memory | J K L M N O P 
BigEndianCPU = 0 BigEndianCPU = 1 
Offset Offset 
vAddro_o Destination |Type|LEM BEM Destination Type| LEM |BEM 
0 | J KLMNOE| 0O 0/7 |EF GHMNOP) 3 4 | 0 
1 Il J KLMNEF] 1 0|6 1 EF GMNOP| 2 4 1 
2 | J KLMEFQG| 2 0 | 5 IJ EFMNOP| 1 4 | 2 
3 | J KLEFGH| 3 0/4 || J KEMNOP)| O 4 | 3 
4 | J KEMNOP| 0O 43 1 J KLEFGH} 3 0 | 4 
5 | J EFMNOP| 1 42 | J KL MEF G 2 0 5 
6 | EF GMNOP| 2 4 | 1 1 J KLMNE F} 1 0 6 
7 EF GHMNOP| 3 4 0 1 J KLMNOE;| O 0 7 
Note: Type: Access type output to memory (refer to Table 16-3 on 
page 324 for information on byte access within a double- 
word) 
Offset: pAddr , Output to memory 
LEM: Little-endian memory (BigEndianMem = 0) 
BEM: Big-endian memory (BigEndianMem = 1) 
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(continued) 


Exceptions: 


TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
Reserved Instruction exception 
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SWR 


Store Word Right SWR 


31 26 25 21 20 16 15 0 
SWR base rt offset 
101110 
6 5 5 16 
Format: 
SWR tt, offset (base) (MIPS I format) 
Description: 


540 


This instruction is used in combination with the SWL instruction to store word 
data in a register to a word in memory that is not at the word boundary. The SWL 
instruction stores the higher portion of the data to memory, while the SWR 
instruction stores the lower portion. 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to generate a virtual address. Of the word data in the memory where 
the least-significant byte is specified by the generated address, only the lower 
portion of general-purpose register rt is stored to memory at the same word 
boundary as the target address. Depending on the address specified, the number of 
bytes to be stored changes from | to 4. 


In other words, first the least-significant byte position of general-purpose register 
rt is stored to the bytes in the addressed memory. If there is data of the high-order 
byte that follows the same word boundary, the operation to store this data to the 
next byte of the memory is repeated. 
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Store Word Right SWR 


(Continued) 


No Address Error exceptions occur when the specified address is not located at the 
word boundary. 


address 4 
address 0 


address 4 
address 0 


Memory 
(Big Endian) Register 
4 5 6 7 | Before 
0 1 2 3 | storing s 2 c EAL et 
SWR $24,4($0) 
D| 5 6 7 | After 
0 { 2 3 storing 
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SWR 


Store Word Right SWR 


(continued) 


Operation: 


32 


64 


T: vAddr < ((offset;s)'® || offset 45 9) + GPR[base] 


(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr < pAddrpgize _ 1...3 || (DAddro. 9 xor ReverseEndian’) 
BigEndianMem = 0 then 
pAddr < pAddr31. 2 || 07 
endif 
byte <— vAddr,_ xor BigEndianCPU? 
if (vAddr> xor BigEndianCPU) = 0 then 
data <— 0°? || GPR[rt}31-s*byte...0 || 08 ¥"* 
else 
data — GPRIrt]31.8:byte..0 || O° PM" || 0°? 
endif 
Storememory (uncached, WORD-byte, data, pAddr, vAddr, DATA) 


vAddr < ((offset;s)*° || offset 45 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
pAddr <— pAddrpsize - 4...3 || (pAddr. 9 xor ReverseEndian’) 
If BigEndianMem = 0 then 
pAddr < pAddr31. 2 || OF 
endif 
byte < vAddr, 9 xor BigEndianCPU 
if (vAddr> xor BigEndianCPU) = 0 then 
data <— 0° || GPRItt]31-*byte...0 || 0°" 
else 
data — GPRIrt]s1.8:byte..0 || O° PM" || 0°? 
endif 
StoreMemory (uncached, WORD-byte, data, pAddr, vAddr, DATA) 
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(continued) 


The relationships between the register contents given to the SWR instruction and 
the result (bytes for words in the memory) are shown below: 


SWR 
Register A B Cc D E F G H 
Memory | J K L M|N O P 
BigEndianCPU = 0 BigEndianCPU = 1 
Offse Offset 
vAddro._o Destination Type |) em BEM Destination Type | EM BEM 
0 1 J KLEFGH} 3 0/4 HJ KLMNOP | 0 7 0 
1 1 J KLFGHP} 2 1 4 GHKLMNOP |} 1 6 0 
2 1 J KLGHOP/ 1 2|4 FGHLMNOP)| 2 5 0 
3 1 J KLHNOP}| O 3/4 EF GHMNOP)| 3 4 0 
4 EF GHMNOP} 3 4 |0 1 J KLHNOP|O 3 4 
5 F GHLMNOP} 2 5 10 lJ KLGHOP)| 1 2 4 
6 GHKLMNOP| 1 6 | 0 1 J KL FGHP) 2 1 4 
7 HJKLMNOP|} 0 7 10 1 J KLEFGH) 3 0 4 


Note: Type: Access type output to memory (refer to Table 16-3 on 
page 324 for information on byte access within a double- 
word) 


Offset. | pAddr , Output to memory 
LEM: Little-endian memory (BigEndianMem = 0) 
BEM: Big-endian memory (BigEndianMem = 1) 
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(continued) 


Exceptions: 


TLB Miss exception 
TLB Invalid exception 
Bus Error exception 
Address Error exception 
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SYNC Synchronize SYNC 


31 26 25 1110 6.5 0 
SPECIAL 0 stype SYNC 
000000 000000000000000 001111 
6 15 5 6 

Format: 
SYNC (MIPS II format) 
Description: 


This instruction is provided for compatibility with MIPS implementations that 
implement multiprocessing facilities. The VR5432 does not implement these 
facilities. This instruction executes as an NOP on the VR5432. 


Operation: 


32,64 T: SyncOperation () 


Exceptions: 


None 
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SYSCALL System Call 


SYSCALL 


31 26 25 6 5 0 
SPECIAL code SYSCALL 
000000 001100 
6 20 6 
Format: 
SYSCALL (MIPS I format) 
Description: 


A System Call exception occurs after this instruction is executed, unconditionally 


transferring control to the exception handler. 


The code field is available for transferring parameters to the exception handler. 
The parameter is retrieved by the exception handler only by loading as data the 
contents of the memory word containing the instruction. 


Operation: 


32,64T: SystemCallException 


Exceptions: 
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System Call exception 
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TEQ Trap If Equal TEQ 
31 26 25 21 20 16 15 5 0 
SPECIAL rs rt code TEQ 
000000 110100 
6 5 5 10 6 

Format: 
TEQtrs, rt (MIPS II format) 
Description: 


The contents of general-purpose register rt are compared with those of general- 
purpose register rs. If the contents of general-purpose register rs are equal to the 
contents of general-purpose register rt, a Trap exception occurs. 


The code field is available for transferring parameters to the exception handler. 
The parameter is retrieved by the exception handler only by loading as data the 


contents of the memory word containing the instruction. 


Operation: 


32, 64 T: if GPR[rs] = GPR[rt] then 
TrapException 
endif 


Exceptions: 


Trap exception 
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TEQI Trap If Equal Immediate TEQI 


31 26 25 21 20 16 15 0 
REGIMM rs TEQI immediate 
000001 01100 
6 5 5 16 
Format: 
TEQI rs, immediate (MIPS II format) 
Description: 


The 16-bit immediate is sign extended and compared with the contents of general- 
purpose register rs. If the contents of general-purpose register rs are equal to the 
sign-extended immediate, a Trap exception occurs. 


Operation: 


32—COSTT:: if GPR[rs] = (immediate,5)'© || immediate;5 9 then 
TrapException 
endif 


64 OT: if GPR[rs] = (immediate ,5)*° || immediate;5 9 then 
TrapException 
endif 


Exceptions: 


Trap exception 
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TG E Trap If Greater Than or Equal TG = 
31 26 25 21 20 16 15 5 0 
SPECIAL rs code TGE 
000000 110000 
6 5 10 6 

Format: 
TGE rs, rt (MIPS II format) 
Description: 


The contents of general-purpose register rt are compared with the contents of 
general-purpose register rs. Interpreting both register contents as signed integers, 
if the contents of general-purpose register rs are greater than or equal to the 
contents of general-purpose register rt, a Trap exception occurs. 


The code field is available for transferring parameters to the exception handler. 
The parameter is retrieved by the exception handler only by loading as data the 


contents of the memory word containing the instruction. 


Operation: 


endif 


32,64 T: if GPR[rs] = GPR[rt] then 
TrapException 


Exceptions: 


Trap exception 
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TG E| Trap If Greater Than or Equal Immediate TG | 
31 26 25 21 20 16 15 0 
REGIMM rs TGEl immediate 
000001 01000 
6 5 5 16 
Format: 
TGEI rs, immediate (MIPS II format) 
Description: 
The 16-bit immediate is sign extended and compared with the contents of general- 
purpose register rs. Interpreting both values as signed integers, if the contents of 
general-purpose register rs are greater than or equal to the sign-extended 
immediate, a Trap exception occurs. 
Operation: 
32 T: if GPR[rs] = (immediate;s)'® || immediate;5 9 then 
TrapException 
endif 
64 T: if GPR[rs] = (immediate;s)*° || immediate;5 9 then 
TrapException 
endif 
Exceptions: 
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Trap exception 
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Trap If Greater Than or Equal 


Immediate Unsigned TG El U 


31 26 25 21 20 16 15 0 
REGIMM rs TGEIU immediate 
000001 01001 
6 5 5 16 
Format: 
TGEIU rs, immediate (MIPS II format) 
Description: 


The 16-bit immediate is sign extended and compared with the contents of general- 
purpose register rs. Interpreting both values as unsigned integers, if the contents 
of general-purpose register rs are greater than or equal to the sign-extended 
immediate, a Trap exception occurs. 


Operation: 
32 T: if (0 || GPR[rs]) = (0 || (immediate;;)'® || immediate;5 9) then 
TrapException 
endif 
64 T: if (0 || GPR[rs]) = (0 || (immediate;;)*° || immediate;5 9) then 
TrapException 
endif 
Exceptions: 


Trap exception 
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TG E U Trap If Greater Than or Equal Unsigned TG E U 


31 26 25 21 20 16 15 6 5 0 
SPECIAL rs rt code TGEU 
000000 110001 
6 5 5 10 6 

Format: 
TGEU rs, rt (MIPS II format) 
Description: 


The contents of general-purpose register rt are compared with the contents of 
general-purpose register rs. Interpreting both values as unsigned integers, if the 
contents of general-purpose register rs are greater than or equal to the contents of 
general-purpose register rt, a Trap exception occurs. 


The code field is available for transferring parameters to the exception handler. 
The parameter is retrieved by the exception handler only by loading as data the 
contents of the memory word containing the instruction. 


Operation: 


32,64 TT: — if (0|| GPR[rs]) = (0 || GPR[rt]) then 
TrapException 
endif 


Exceptions: 


Trap exception 
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Probe TLB for Matching Entry 


TLBP 


Coprocessor Unusable exception 
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31 26 2524 65 0 
COPO CO 0 TLBP 
010000 1 0000000000000000000 001000 
6 1 19 6 
Format: 
TLBP (MIPS I format) 
Description: 
Searches for a TLB entry that matches with the contents of the EntryHI register 
and stores an index for that TLB entry in the Index register. If a TLB entry that 
matches is not found, sets the most-significant bit of the Index register. 
Memory references by the instruction immediately after a TLBP instruction, or in 
cases in which more than one TLB entry is a hit, are undefined. 
Operation: 
32 T:  Indexe1 || 02° || Undefined® 
for i in 0...TLBEntries — 1 
if (TLB[ilg5..77 = EntryHig,..43) and (TLB[i]7¢ or 
(TLB[i]71...64 = EntryHi7. 9)) then 
Index + 0°6 ||is 9 
endif 
endfor 
64. T:  Indexe1 || 02° || Undefined® 
for i in 0...TLBEntries — 1 
if (TLBLi]167..141 and not (0'° || TLBLi}16...205)) 
= (EntryHigg,, 13 and not (0'° || TLBliJa16...205)) and 
(TLB[I]140 of (TLBUi} 135,128 = EntryHiz7, 9) then 
Index + 0° ||is5 9 
endif 
endfor 
Exceptions: 
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T L B R Read Indexed TLB Entry T L B R 


31 26 2524 65 0 
COPO CO 0 TLBR 
010000 1 0000000000000000000 000001 
6 1 19 6 
Format: 
TLBR (MIPS I format) 
Description: 


The EntryHi and EntryLo registers are loaded with the contents of the TLB entry 
selected by the contents of the Index register. The G bit (which controls ASID 
matching) read from the TLB is written into both of the EntryLo0 and EntryLol 
registers. 


The operation is invalid (and the results are undefined) if the contents of the Index 
register are greater than the number of TLB entries in the processor. 


Operation: 


32 T: PageMask < TLB[Indexs, 9]127...96 
EntryHi <— TLB[Indexs, olgs..¢64 and not TLB[Indexs o]127. 96 
EntryLo1 <-TLB[Indexs, glg3.. 33l| TLB[Indexs gl7¢ 
EntryLo0 < TLB[Indexs, ]31...4|| TLB[Indexs. ol7¢ 


64 T: PageMask < TLB[Indexs. olos5s...192 
EntryHi < TLB[Indexs, 9]191..12g and not TLB[Indexs glo55...192 
EntryLo1 <-TLB[Indexs, 9]127...65 || TLB[Indexs. ol140 
EntryLoO << TLB[Indexs, ole3...1 || TLB[Indexs, ol140 


Exceptions: 


Coprocessor Unusable exception 
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TL BWI Write Indexed TLB Entry TL BWI 


31 26 2524 65 0 
COPO CO 0 TLBWI 
010000 1 0000000000000000000 000010 
6 1 19 6 
Format: 
TLBWI (MIPS I format) 
Description: 


The TLB entry selected by the Index register is loaded with the contents of the 
EntryHi and EntryLo registers. The G bit of the TLB is written with the logical 
AND of the G bits in the EntryLo0 and EntryLo1 registers. 


The operation is invalid (and the results are undefined) if the value in the Index 
register is greater than the number of TLB entries in the processor. 


Operation: 


32,64 T: TLB[Indexs 9] <— 
PageMask || (EntryHi and not PageMask) || EntryLo1 || EntryLo0 


Exceptions: 


Coprocessor Unusable exception 
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TLBWR Write Random TLB Entry TLBWR 


31 26 2524 65 0 
COPO CO 0 TLBWR 
010000 1 0000000000000000000 000110 
6 1 19 6 
Format: 
TLBWR (MIPS I format) 
Description: 


The TLB entry selected by the Random register is loaded with the contents of the 
EntryHi and EntryLo registers. The G bit of the TLB is written with the logical 
AND of the G bits in the EntryLo0 and EntryLo1 registers. 


Operation: 


32,64 T: TLB[Randoms_ o] <— 
PageMask || (EntryHi and not PageMask) || EntryLo1 || EntryLo0 


Exceptions: 


Coprocessor Unusable exception 
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Trap If Less Than 


CPU Instruction Set 


TLT 


31 26 25 21 20 16 15 5 0 
SPECIAL rs rt code TLT 
000000 110010 
6 5 5 10 6 
Format: 
TLT rs, rt (MIPS II format) 
Description: 


The contents of general-purpose register rt are compared with general-purpose 
register rs. Interpreting both values as signed integers, if the contents of general- 
purpose register rs are less than the contents of general-purpose register rt, a Trap 


exception occurs. 


The code field is available for transferring parameters to the exception handler. 
The parameter is retrieved by the exception handler only by loading as data the 


contents of the memory word containing the instruction. 


Operation: 


32, 64 


T: if GPR[rs] < GPR[rt] then 
TrapException 
endif 


Exceptions: 


Trap exception 


Vr5432 Microprocessor User’s Manual 


557 


Chapter 17 


TLTI Trap If Less Than Immediate TLTI 


31 26 25 21 20 16 15 0 
REGIMM rs TLTI immediate 
000001 01010 
6 5 5 16 
Format: 
TLTI rs, immediate (MIPS II format) 
Description: 


The 16-bit immediate is sign extended and compared with the contents of general- 
purpose register rs. Interpreting both values as signed integers, if the contents of 
general-purpose register rs are less than the sign-extended immediate, a Trap 
exception occurs. 


Operation: 
32 T: if GPR[rs] < (immediate,5)'® || immediate;s5 ¢ then 
TrapException 
endif 
64 T: if GPR[rs] < (immediate,5)*° || immediate;s5 0 then 
TrapException 
endif 
Exceptions: 


Trap exception 
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CPU Instruction Set 


TLTIU 


31 26 25 21 20 16 15 0 
REGIMM rs TLTIU immediate 
000001 01011 
6 5 5 16 
Format: 
TLTIU rs, immediate (MIPS II format) 


Description: 


The 16-bit immediate is sign extended and compared with the contents of general- 
purpose register rs. Interpreting both values as unsigned integers, if the contents 
of general-purpose register rs are less than the sign-extended immediate, a Trap 


exception occurs. 


Operation: 
32 T: if (0 || GPR[rs]) < (0 || (immediate;s)'® || immediate;s5 0) then 
TrapException 
endif 
64 T: if (0 || GPR[rs]) < (0 || (immediate,5)*° || immediate;5 0) then 
TrapException 
endif 
Exceptions: 


Trap exception 
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TLTU 


Trap If Less Than Unsigned 


TLTU 


31 26 25 21 20 16 15 0 
SPECIAL rs rt code TLTU 
000000 110011 
6 5 5 10 6 

Format: 
TLTU rs, rt (MIPS II format) 


Description: 


The contents of general-purpose register rt are compared with general-purpose 
register rs. Interpreting both values as unsigned integers, if the contents of 
general-purpose register rs are less than the contents of general-purpose register 
rt, a Trap exception occurs. 


The code field is available for transferring parameters to the exception handler. 
The parameter is retrieved by the exception handler only by loading as data the 


contents of the memory word containing the instruction. 


Operation: 


32, 64T: 


if (0 || GPR[rs]) < (0 || GPR[rt]) then 
TrapException 
endif 


Exceptions: 


Trap exception 


560 


Vr5432 Microprocessor User’s Manual 


TNE 


CPU Instruction Set 


Trap If Not Equal ™N E 


31 26 25 21 20 16 15 6 5 0 
SPECIAL rs rt code TNE 
000000 110110 
6 5 5 10 6 
Format: 
TNE rs, rt (MIPS II format) 
Description: 


The contents of general-purpose register rt are compared with those of general- 
purpose register rs. If the contents of general-purpose register rs are not equal to 
the contents of general-purpose register rt, a Trap exception occurs. 


The code field is available for transferring parameters to the exception handler. 
The parameter is retrieved by the exception handler only by loading as data the 
contents of the memory word containing the instruction. 


Operation: 


32,64T: if GPR[rs] # GPR[rt] then 


TrapException 
endif 


Exceptions: 


Trap exception 
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TN E| Trap If Not Equal Immediate ™N E | 


31 26 25 21 20 16 15 0 
REGIMM rs TNEI immediate 
000001 01110 
6 5 5 16 
Format: 
TNEI rs, immediate (MIPS II format) 
Description: 


The 16-bit immediate is sign extended and compared with the contents of general- 
purpose register rs. If the contents of general-purpose register rs are not equal to 
the sign-extended immediate, a Trap exception occurs. 


Operation: 
32 T: if GPR[rs] # (immediate,5)'® || immediate;5 9 then 
TrapException 
endif 
64 T: if GPR[rs] # (immediate,5)*° || immediate;5 9 then 
TrapException 
endif 
Exceptions: 


Trap exception 
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XOR Exclusive OR XOR 


31 26 25 21 20 16 15 11 10 6 5 0 
SPECIAL rs rt rd 0 XOR 
000000 00000 100110 
6 5 5 5 5 6 
Format: 
XOR rd, rs, rt (MIPS I format) 
Description: 


The contents of general-purpose register rs are bitwise ORed with the contents of 
general-purpose register rt. The result is stored into general-purpose register rd. 


Operation: 


32,64T: GPR{[rd] < GPR[rs] xor GPR[rt] 


Exceptions: 


None 
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XO R | Exclusive OR Immediate XO Pp i 


31 26 25 21 20 16 15 0 
XORI rs rt immediate 
001110 
6 5 5 16 
Format: 
XORI rt, rs, immediate (MIPS I format) 
Description: 


The 16-bit zero-extended immediate is bitwise ORed with the contents of general- 
purpose register rs. The result is stored in general-purpose register rt. 


Operation: 


32. T:  GPR{[rt] < GPR[rs] xor (0'° || immediate) 
64. T:  GPR{[rt] << GPR[rs] xor (0*° || immediate) 


Exceptions: 


None 
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je | CPU Instruction Opcode Bit Encoding 


Figure 17-1 and Figure 17-2 list the VR5432 opcode bit encoding. 


28...26 Opcode 
31...29 0 1 2 3 4 5 6 7 
0 SPECIAL | REGIMM J JAL BEQ BNE BLEZ BGTZ 
1 ADDI ADDIU SLTI SLTIU ANDI ORI XORI LUI 
2 COPO COP1 COP2 * BEQL BNEL BLEZL BGTZL 
3 DADDle |DADDIUe| LDLe LDRe DEBUG = = = 
4 LB LH LWL LW LBU LHU LWR LWUe 
5 SB SH SWL SW SDLe SDRe SWR__ |CACHEd 
6 LL LWC1 LWC2 * LLDe LDC1 LDC2 LDe 
7 SC SWC1 SWC2 # SCDe SDC1 SDC2 SDe 
2...0 SPECIAL function 
5...3 0 2 3 4 5 6 7 
0 SLL * SRLp SRA SLLV * SRLVp SRAV 
1 JR JALR * * SYSCALL| BREAK * SYNC 
2 MFHI MTHI MFLO MTLO DSLLVe * DSRLVep| DSRAVe 
3 MULTp | MULTUp DIV DIVU DMULTe |DMULTUe| DDIVe DDIVUe 
4 ADD ADDU SUB SUBU AND OR XOR NOR 
5 P a SLT SLTU DADDe | DADDUe| DSUBe | DSUBUe 
6 TGE TGEU TLT TLTU TEQ * TNE * 
7 DSLLe x DSRLep | DSRAe | DSLL32e * DSRL32ep DSRA32e 
18...16 REGIMM rt 
20...19 0 1 2 3 4 5 6 7 
0 BLTZ BGEZ BLTZL BGEZL * * * * 
1 TGEl TGEIU TLTI TLTIU TEQI * TNE| id 
2 BLTZAL | BGEZAL | BLTZALL | BGEZALL * * * * 
3 * * * * * * * * 
23...21 COPz rs 
25, 24 0 1 2 3 4 5 6 7 
0 MF DMFe CF g MT DMTe CT g 
{ BC g g g g g g g 
2 ere) 
3 


Figure 17-1 VR5432 Opcode Bit Encoding (1 of 2) 
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18.16 COret 
20...19 0) 1 2 3 4 5 6 7 
0 BCF BCT BCFL BCTL Y Y Y Y 
{ Y Y Y Y Y Y Y Y 
2 Y Y Y af Y x Y Y 
3 Y Y Y Y Y Y Y Y 
‘ji CPO Function 
5.3.0 1 2 3 4 5 6 7 
0 () TLBR_ | TLBWI TLBWR 
1 | TLBP 
2 5 
3 |ERET X 
4 o 
5 o 
6 > 
7 Y 
20 DEBUG Function 
5.3.0 1 2 3 4 5 6 7 
0 
1 
2 
3 
4 
5 
6 
7 |MF/TDR| DRET |DBREAK 
Figure 17-2. VR5432 Opcode Bit Encoding (2 of 2) 
Key: 
: If the operation code marked with an asterisk is executed, the 
Reserved Instruction exception occurs. These codes are reserved 
for future expansion. 
y Operation codes marked with a gamma cause a Reserved 
Instruction exception. They are reserved for future expansion 
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3) Operation codes marked with a delta are valid only with CP 
enabled and cause a Reserved Instruction exception on other 
processors. 

(0) Operation codes marked with a phi are invalid but do not caus 


Reserved Instruction exceptions 


c Operation codes marked with a xi cause a Reserved Instructio 
exception. 
x Operation codes marked with a chi are valid only on VR4000 and 


VR5000 processors. 


E Operation codes marked with an epsilon are valid in the 64-bit 
mode and 32-bit Kernel mode. In the 32-bit User or Supervisor 
mode, these codes generate the Reserved Instruction exception. 


T Operation codes marked with a pi have been used for the 
implementation-specific instruction set extensions on th 
VR5432, specifically the Multiply-Accumulate instructions and 
the Rotate instructions 
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18 


This chapter provides a detailed description of each Floating-Point Unit (FPU) 
instruction. (For a general overview of Vr5432 instructions, see Chapter 16.) 


18.1 Instruction Formats 
The instruction description subsections that follow show how the three basic 
instruction formats (I-, R-, and J-type) are used by: 
¢ Load and Store instructions 
e Transfer instructions 
e Floating-point arithmetic instructions 


e Floating-point Branch instruction 


Floating-Point instructions are mapped onto the MIPS coprocessor instructions, 
defining Coprocessor one (CP1) as the floating-point unit. 


Each operation is valid only for certain formats. Implementations may support 
some of these formats and operations through emulation, but they only need to 
support combinations that are valid (marked V in Table 18-1). Combinations 
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marked R (reserved) in Table 18-1 are not currently specified by this architecture, 


and cause an Unimplemented Instruction exception. They are reserved for future 


extensions of the architecture. 


Table 18-1 Valid FPU Instruction Forma t 


Operation 


Source Format 


Single 


Double 


Word 


Longword 


ADD 


SUB 


MUL 


DIV 


SQRT 


ABS 


AA AAA A 


AlA|A|A|A| A 


MOV 
NEG 


TRUNC.L 


ROUND.L 
CEIL.L 


FLOOR.L 


TRUNC.W 
RECIP 


ROUND.W 


RSQRT 
CEIL.W 


FLOOR.W 


<p <<] <j <<} <j <<] <i] <] <<] <] <1] <] <J| <| <|] <] <] < 


CVT.S 


<p <<] <<] <<] <<] <<] <i] < | <] <] <] <] <| <]| <| <| <] <| < 


CVT.D 


CVT.W 


<q 


CVT.L 


C 


<|<) <| < 


Vv 


R 


R 


The FPU Branch instruction can be used with the logic of the condition reversed, 


so it is only necessary to provide half of the 32 comparison predicates and 
relations required by the IEEE-754 standard. A four-bit field in the C instruction 
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(comparison) specifies one of the 16 conditions shown in the “True” column of 
Table 18-2. Inverting the sense of the condition in the Branch instruction provides 
the 16 conditions shown in the “False” column. Unordered conditions result when 
one of the operands is a NaN (i.e., a “Not a Number” encoding), which has no 
numerical order when compared to a number or another NaN. 


Table 18-2 Logical Reverse of Predicates by Condition True/False 


Condition Relations Invalid 
A ration 
ue Gaile. ee ee Equal | Unordered Esception if 
True False Than Than Unordered 

F T 0 F F F F No 

UN OR 1 F F F T No 

EQ NEQ 2 F F T F No 

UEQ OGL 3 F F T T No 

OLT UGE 4 F T F F No 

ULT OGE 5 F T F T No 

OLE UGT 6 F T T F No 

ULE OGT 7 F T T T No 

SF ST 8 F F F F Yes 

NGLE GLE 9 F F F T Yes 

SEQ SNE 10 F F T F Yes 

NGL GL 11 F F T T Yes 

LT NLT 12 F T F F Yes 

NGE GE 13 F T F T Yes 

LE NLE 14 F T T F Yes 

NGT GT 15 F T T T Yes 

F: False 

T: True 
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Floating-Point Loads, Stores, and Transfers 


All movement of data between the floating-point unit (FPU) and memory is 
accomplished by load and store operations, which reference the Floating-Point 
General-Purpose registers (FGRs). These operations are unformatted; no format 
conversions are performed and, therefore, no floating-point exceptions can be 
generated by these operations. 


Data may also be directly moved between the floating-point unit and the processor 
by Move to Coprocessor (MTC) and Move from Coprocessor (MFC) instructions. 
Like the floating-point load and store operations, these operations perform no 
format conversions and never cause floating-point exceptions. 


In addition, two Floating-Point Control registers (FCRs) are provided for FPU 
control bits, status bits, implementation level, and revision level. These registers 
can only be accessed by the CTC1 and CFC1 instructions. 


Floating-Point Operations 


The floating-point unit instruction set includes: 
e Floating-point Add instructions 
¢ Floating-point Subtract instruction 
¢ Floating-point Multiply instruction 
e Floating-point Divide instruction 
¢ Floating-point Square Root instructions 
e Floating-point Reciprocal instruction 
¢ Floating-point Reciprocal Square Root instructions 
¢ Conversion between fixed-point and floating-point format 
¢ Conversion between floating-point format 
¢ Floating-point Compare instructions 
These operations satisfy the requirements of the IEEE-754 standard for accuracy. 


Specifically, these operations obtain a result identical to an infinite-precision 
result rounded to the specified format, using the current rounding mode. 


Instructions must specify the format of their operands. Except for conversion 
functions, mixed-format operations cannot be performed. 


In the VR5432 implementation, the instruction immediately following a load may 
use the contents of the register being loaded. In such cases, the hardware 
interlocks by the number of cycles required for reading. Scheduling load delay 
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slots is not required for functional code; however, it still may be desirable for 
highest performance or compatibility with the VR3000 device (which lacks 
interlocks). 


Load and Store instruction behavior depends on FGR width. 


¢ When the FR bit in the Status register is clear, the Floating-Point 
General-Purpose registers (FGRs) are 32 bits wide. 


¢ To hold single-precision floating-point format data, sixteen even- 
numbered registers out of 32 FGRs are available. 


¢ To hold double-precision floating-point format data, the 32-bit 
registers are used in pairs as 16 64-bit registers 


¢ When the FR bit in the Status register is set, the FGRs are 64 bits 
wide. 


¢ To hold single-precision floating-point format data, the low half of 32 
FGRs are used. 


¢ To hold double-precision floating-point format data, 32 FGRs are 
used. 


In the load and store operation descriptions, the functions listed in 
Table 18-3 are used to represent the handling of virtual addresses and physical 
memory. 


Table 18-3 FPU Load/Store Instructions Using Registe r +Register Addressing 


Mnemonic Description sant 
LWXCl Load Word Indexed to FPU IV 
SWXCl Store Word Indexed from FPU IV 
LDXCl Load Doubleword Indexed to FPU IV 
SDXCl Store Doubleword Indexed from FPU IV 
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Figure 18-1 shows the I-type instruction format used by Load and Store 


instructions. 


I-type (Immediate) 


31 26 25 21 20 16 «15 0 


offset 


op: 6-bit opcod 
base: 5-bit base register specifier 
ft: 5-bit source (for stores) or destination (for loads) FPU register specifie 
offset: 16-bit signed immediate offset 


Figure 18-1 Load and Store Instruction Format 


All coprocessor loads and stores reference data that is located at word boundaries. 
For word loads and stores, the access type field is always word, and the low-order 
two bits of the address must always be zero. For doubleword loads and stores, the 
access type field is always doubleword, and the low-order three bits of the address 
must always be zero. 


Regardless of byte-numbering order, the address specifies the byte that has the 
smallest byte address in the accessed field. For a big-endian system, this is the left- 
most byte; for a little-endian system, this is the right-most byte. 
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18.2 Floating-Point Computational Instructions 


Computational instructions include all of the floating-point arithmetic operations 
performed by the FPU. 


Figure 18-2 shows the R-type instruction format used for computational 
instructions. 


R-type (Register) 


31 26 25 21 20 16 15 11.10 6 5 0 


COP1 fmt ft fs fd function 


COP1: 6-bit opcode 
fmt: 5-bit format specifie 
fs: 5-bit source 1 register 
ft: 5-bit source 2 register 
fd: 5-bit destination register 


function: 6-bit function field 


Figure 18-2 Computational Instruction Format 


The function field indicates the floating-point operation to be performed. 


Each floating-point instruction can be applied to a number of operand formats. 
The operand format for an instruction is specified by the 5-bit format field (fmt); 
decoding for this field is shown in Table 18-4. 


Table 18-4 Format Field Decoding 


Code Mnemonic Size Format 

0-15 Reserved 

16 S Single (32 bits) Binary floating-point 
17 D Double (64 bits) Binary floating-point 
18 Reserved 

19 Reserved 

20 Ww 32 bits Binary fixed-point 

21 L 64 bits Binary fixed-point 
22-31 Reserved 
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Table 18-5 lists all floating-point computational instructions. 


Table 18-5 Floating-Point Computational Instructions and Operations 


a Mnemonic Operation 
0 ADD Add 
1 SUB Subtract 
2 MUL Multiply 
3 DIV Divide 
4 SQRT Square root 
5 ABS Absolute value 
6 MOV Transfer 
7 NEG Sign reverse 
8 ROUND.L Convert to 64-bit fixed-point, rounded to nearest even number 
9 TRUNC.L Convert to 64-bit fixed-point, rounded toward zero 
10 CEIL.L Convert to 64-bit fixed-point, rounded to + 
11 FLOOR.L Convert to 64-bit fixed-point, rounded to — co 
12 ROUND.W Convert to 32-bit fixed-point, rounded to nearest even number 
13 TRUNC.W Convert to 32-bit fixed-point, rounded toward zero 
14 CEIL.W Convert to 32-bit fixed-point, rounded to + co 
15 FLOOR.W Convert to 32-bit fixed-point, rounded to — co 
16-31 —= Reserved 
32 CVT.S Convert to single floating-point 
33 CVT.D Convert to double floating-point 
34 — Reserved 
35 — Reserved 
36 CVT.W Convert to 32-bit fixed-point 
37 CVT.L Convert to 64-bit fixed-point 
38-47 — Reserved 
48-63 C Floating-Point Compare 
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The following routines are used in the description of the floating-point operations 
to retrieve the value of an FPR or to change the value of an FGR: 


32-Bit Mode 


end 


end 


value <-- ValueFPR(fpr, fmt) 
/* undefined for odd fpr */ 
case fmt of 
S, W: 
value <-- FGR[fpr+0] 
D: 
value <-- FGR[fpr+1] || FGR[fpr+0] 


StoreFPR (fpr, fmt, value): 
/* undefined for odd fpr */ 
case fmt of 
S, W: 
FGR[fpr+1] <-- undefined 
FGR[fpr+0] <-- value 


FGR[fpr+1] <-- value63...32 
FGR[fpr+0] <-- value31...0 
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64-Bit Mode 


value <-- ValueFPR(fpr, fmt) 
case fmt of 
S, W: 
value <-- FGR[fpr]31...0 
D, L: 
value <-- FGR[fpr] 
end 


StoreFPR(fpr, fmt, value): 
case fmt of 
S, W: 
FGR[fpr] <-- undefined32 || value 
D, L: 
FGR[fpr] <-- value 
end 


FPU Instructions 


This section describes in detail the FPU instructions. 


Exceptions that may occur are listed at the end of each instruction’s description. 
For details regarding FPU exceptions and exception processing, refer to Chapter 


8. 
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ABS.fmt pects Vane ABS.fm 


31 


26 25 21 20 16 15 11 10 6 5 0 


COP1 
010001 


fmt 0 fs fd ABS 
00000 000101 


6 


5 5 5 5 6 


Format: 


ABS.fmt fd, fs (MIPS I format) 


Description: 


The absolute value of the contents of floating-point register fs is taken and stored 
in floating-point register fd. The operand is processed in the floating-point format 


fint. 


The absolute value operation is arithmetically performed. If the operand is NaN, 
therefore, the Invalid Operation exception occurs. 


This instruction is valid only in the single- and double-precision floating-point 
formats. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. 


If the FR bit is 1, both odd and even register numbers are valid. 


Operation: 


32,64T: | StoreFPR (fd, fmt, AbsoluteValue (ValueFPR (fs, fmt) ) ) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
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ADD.fmt Floating-Point Add ADD.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 
COP1 fmt ft fs fd ADD 
010001 000000 
6 5 5 5 5 6 
Format: 
ADD.fmt fd, fs, ft (MIPS I format) 
Description: 


The contents of floating-point registers fs and ft are added and the result is stored 
in floating-point register fd. The operand is processed in the floating-point format 
jmt. The operation is executed as if the accuracy were infinite, and the result is 
rounded according to the current rounding mode. 


This instruction is valid only in the single- and double-precision floating-point 
formats. 


If the FR bit of the Status register is 0, only an even number can be specified as 
a register number because adjacent even-numbered and odd-numbered registers 
are used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


Operation: 


32,64T: | StoreFPR (fd, fmt, ValueFPR (fs, fmt) + ValueFPR (ft, fmt) ) 
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ADD.fmt Floating-Point Add ADD.fmt 


(continued) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 

Inexact Operation exception 
Overflow exception 

Underflow exception 
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BC1F 


Branch on FPU False 
(Coprocessor 1) BC1 F 


31 


26 25 21 20 18171615 0 


COP1 
010001 


BC cc nd| tf offset 
01000 0/0 


6 


5 3 1 1 16 


582 


Format: 


BCIF offset (MIPS I format, cc = 0 is implied) 
BCIF cc, offset (MIPS IV format) 


Description: 


An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the 
address of the instruction following the branch (not the branch itself) in the branch 
delay slot to form a PC-relative effective target address. If the floating-point 
condition code bit cc is false (0), the program branches to the effective target 
address after the instruction in the delay slot is executed. 


A floating-point condition code is set by the floating-point Compare instruction, 
C.cond.fmt. 


The MIPS I architecture defines a single floating-point condition code, 
implemented as the Coprocess orl condition signal (Cp/ Cond) and the C bit in the 
FCR31 register. MIP SI, I, and HI architectures must have the cc field set to 0, 
which is implied by the first format in the “Format” section above. Both assembler 
formats are valid for MIPS IV. 


The MIPS IV architecture adds seven more condition code bits to the original 
condition code 0. Floating-Point Compare and Conditional Branch instructions 
specify the condition code bit to set or test. The condition code bit specified by the 
cc field is modified by the ¢f (true/false) and nd (nullify delay slot) fields as 
variables. The individual instructions BC1F, BCIFL, BC1T, and BCITL have 
specific values for ¢f and nd. 


In the MIPS I, II, and III implementations, there must be at least one instruction 
between the Compare instruction that sets the condition code and the Branch 
instruction that tests it. Hardware does not detect a violation of this restriction. In 
the MIPS IV instruction set, this restriction has been removed. 
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BC1 F Branch on FPU False BC1 F 


(Coprocessor 1) 
(continued) 


Operation: 


MIPS I, Il, and III: 
T-1: condition <— FPConditionCode(0) = 0 
T:  target_offset < (offset,,)GPRLEN-(16+2) |) offset || 07 
T+1: if condition then 
PC < PC + target_offset 
endif 
MIPS IV: 
T: — condition < FPConditionCode(cc) = 0 
target_offset < (offset,<)GPRLEN (15+?) |) offset || 07 
T+1: if condition then 
PC < PC + target_offset 
endif 


Note: With the 18-bit signed instruction offset, the conditional branch 
range is +128K. Use the Jump (J) or Jump Register (JR) instruc- 
tions to branch to addresses outside this range. 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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Branch on FPU False Likely 
BC1 FL (Coprocessor 1) BC1 FL 
31 26 25 21 20 18171615 0 
COP1 BC cc _|nd| tf offset 
010001 01000 1/0 
6 5 3 11 16 
Format: 
BCIFL offset (MIPS I format, cc = 0 is implied) 
BCIFL cc, offset (MIPS IV format) 
Description: 
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An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the 
address of the instruction following the branch (not the branch itself) in the branch 
delay slot to form a PC-relative effective target address. If the floating-point 
condition code bit cc is false (0), the program branches to the effective target 
address after the instruction in the delay slot is executed. If the branch is not taken, 
the instruction in the delay slot is not executed. 


A floating-point condition code is set by the Floating-Point Compare instruction, 
C.cond.fmt. 


The MIPS I architecture defines a single floating-point condition code, 
implemented as the Coprocess orl condition signal (Cp/ Cond) and the C bit in the 
FCR31 register. MIP SI, I, and II architectures must have the cc field set to 0, 
which is implied by the first format in the “Format” section above. Both assembler 
formats are valid for MIPS IV. 


The MIPS IV architecture adds seven more condition code bits to the original 
condition code 0. Floating-Point Compare and Conditional Branch instructions 
specify the condition code bit to set or test. The condition code bit specified by the 
cc field is modified by the ¢f (true/false) and nd (nullify delay slot) fields as 
variables. The individual instructions BC1F, BCIFL, BC1T, and BCITL have 
specific values for tf and nd. 


In the MIPS I, II, and II] implementations, there must be at least one instruction 
between the Compare instruction that sets the condition code and the Branch 
instruction that tests it. Hardware does not detect a violation of this restriction. In 
the MIPS IV instruction set, this restriction has been removed. 
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BC1 FL Branch on FPU False Likely BC1 FL 


(Coprocessor 1) 
(continued) 


Operation: 


MIPS I, Il, and III: 
T-1: condition <— FPConditionCode(0) = 0 
T:  target_offset < (offset,,)GPRLEN-(16+2) |) offset || 07 
T+1: if condition then 
PC < PC + target_offset 
else 
NullifyCurrentinstruction() 
endif 
MIPS IV: 
T: — condition < FPConditionCode(cc) = 0 
target_offset < (offset, 5)GPRLEN- (15+) |) offset || 0° 
T+1: if condition then 
PC < PC + target_offset 
else 
NullifyCurrentInstruction() 


endif 


Note: Software should only use this instruction when there is a very high 
probability (98% or more) that the branch will be taken. If the 
branch is not likely to be taken or if the probability of a taken 
branch is unknown, users are encouraged to use the BCIF instruc- 
tion instead. 
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BC1 FL Branch on FPU False Likely BC1 FL 


(Coprocessor 1) 
(continued) 


Note: With the 18-bit signed instruction offset, the conditional branch 
range is +128K. Use Jump (J) or Jump Register (JR) instructions to 
branch to addresses outside this range. 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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Branch on FPU True 
BC1 T (Coprocessor 1) BC1 T 
31 26 25 21 20 18171615 0 
COP1 BC cc nd] tf offset 
010001 01000 0} 1 
6 5 3 11 16 
Format: 
BCIT offset (MIPS I format, cc = 0 is implied) 
BCIT cc, offset (MIPS IV format) 
Description: 


An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the 
address of the instruction following the branch (not the branch itself) in the branch 
delay slot to form a PC-relative effective target address. If the floating-point 
condition code bit cc is false (0), the program branches to the effective target 
address after the instruction in the delay slot is executed. 


A floating-point condition code is set by the floating-point Compare instruction, 
C.cond.fmt. 


The MIPS I architecture defines a single floating-point condition code, 
implemented as the Coprocess orl condition signal (Cp/ Cond) and the C bit in the 
FCR31 register. MIP SI, I, and HI architectures must have the cc field set to 0, 
which is implied by the first format in the “Format” section above. Both assembler 
formats are valid for MIPS IV. 


The MIPS IV architecture adds seven more condition code bits to the original 
condition code 0. Floating-Point Compare and Conditional Branch instructions 
specify the condition code bit to set or test. The condition code bit specified by the 
cc field is modified by the ¢f (true/false) and nd (nullify delay slot) fields as 
variables. The individual instructions BC1F, BCIFL, BC1T, and BCITL have 
specific values for ¢f and nd. 


In the MIPS I, II, and II implementations, there must be at least one instruction 
between the Compare instruction that sets the condition code and the Branch 
instruction that tests it. Hardware does not detect a violation of this restriction. In 
the MIPS IV instruction set, this restriction has been removed. 
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BC1 T Branch on FPU True BC1 T 


(Coprocessor 1) 
(continued) 


Operation: 


MIPS I, Il, and III: 
T-1: condition <— FPConditionCode(0) = 1 
T:  target_offset < (offset,,)GPRLEN-(16+2) |) offset || 07 
T+1: if condition then 
PC < PC + target_offset 
endif 
MIPS IV: 
T: condition < FPConditionCode(cc) = 1 
target_offset < (offset,<)GPRLEN (15+?) |) offset || 07 
T+1: if condition then 
PC < PC + target_offset 
endif 


Note: With the 18-bit signed instruction offset, the conditional branch 
range is + 128K. Use Jump (J) or Jump Register (JR) instructions 
to branch to addresses outside this range. 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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Branch on FPU True Likely 
BC1 TL (Coprocessor 1) BC1 TL 
31 26 25 21 20 18171615 0 
COP1 BC cc _|nd| tf offset 
010001 01000 1] 1 
6 5 3 1 $1 16 
Format: 
BCITL offset (MIPS I format, cc = 0 is implied) 
BCITL cc, offset (MIPS IV format) 
Description: 


An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the 
address of the instruction following the branch (not the branch itself) in the branch 
delay slot to form a PC-relative effective target address. If the floating-point 
condition code bit cc is false (0), the program branches to the effective target 
address after the instruction in the delay slot is executed. If the branch is not taken, 
the instruction in the delay slot is not executed. 


A floating-point condition code is set by the floating-point Compare instruction, 
C.cond.fmt. 


The MIPS I architecture defines a single floating-point condition code, 
implemented as the Coprocess orl condition signal (Cp/ Cond) and the C bit in the 
FCR31 register. MIP SI, I, and II architectures must have the cc field set to 0, 
which is implied by the first format in the “Format” section above. Both assembler 
formats are valid for MIPS IV. 


The MIPS IV architecture adds seven more condition code bits to the original 
condition code 0. Floating-Point Compare and Conditional Branch instructions 
specify the condition code bit to set or test. The condition code bit specified by the 
cc field is modified by the ¢f (true/false) and nd (nullify delay slot) fields as 
variables. The individual instructions BC1F, BCIFL, BC1T, and BCITL have 
specific values for tf and nd. 


In the MIPS I, II, and III implementations, there must be at least one instruction 
between the Compare instruction that sets the condition code and the Branch 
instruction that tests it. Hardware does not detect a violation of this restriction. In 
the MIPS IV instruction set, this restriction has been removed. 
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BC1 TL Branch on FPU True Likely 


(Coprocessor 1) 
(continued) 


Operation: 


MIPS I, Il, and III: 


T-1: condition <— FPConditionCode(0) = 1 
(GEREN Ute) | offset | 02 


T: — target_offset < (offset;s 
T+1: if condition then 


PC < PC + target_offset 


else 


NullifyCurrentinstruction() 


endif 
MIPS IV: 


T: — condition < FPConditionCode(cc) = 1 
GPRLEN-(16+2) | offset | 02 


target_offset < (offsetys) 
T+1: if condition then 


PC < PC + target_offset 


else 


NullifyCurrentInstruction() 


endif 


BC1ITL 


590 


Note: Software should only use this instruction when there is a very high 
probability (98% or more) that the branch will be taken. If the 
branch is not likely to be taken or if the probability of a taken 
branch is unknown, users are encouraged to use the BCIT instruc- 


tion instead. 
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BC1 TL Branch on FPU True Likely BC1 TL 


(Coprocessor 1) 
(continued) 


Note: With the 18-bit signed instruction offset, the conditional branch 
range is + 128K. Use Jump (J) or Jump Register (JR) instructions 
to branch to addresses outside this range. 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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C.cond.fmt perenne C.cond.fmt 


26 25 21 20 16 15 1110 876543 0 


fmt ft fs cc 0 | FC cond 
00/11 


5 5 5 3 2 2 4 


Format: 


C.cond.fmt fs, ft (MIPS I format, cc = 0 is implied) 
C.cond.fmt cc, fs, ft (MIPS IV format) 


Description: 


592 


The value in floating-point register fs is compared to the value in floating-point 
register ft; the values are in format fmt. The comparison is exact and neither 
overflows nor underflows. 


If the comparison specified by cond _; is true for the operand values, the result is 
true; otherwise, the result is false. If no exception is taken, the result is written into 
condition code cc; true is 1 and false is 0. 


If cond; is set and at least one of the values is a NaN, an Invalid Operation 
condition is raised; the result depends on the Floating-Point exception model 
currently active: 


Precise exception model: The Invalid Operation flag is set in the FCR31 register. 
If the Invalid Operation Enable bit is set in the FCR31 register, no result is written 
and an Invalid Operation exception is taken immediately. Otherwise, the Boolean 
result is written into condition code cc. 


Imprecise exception model (R8000® normal mode): The Boolean result is 
written into condition code cc. No FCR31 register flag is set. If th Invalid 
Operation Enabl bit is set in the FCR31 register, an Invalid Operation 
exception is taken, imprecisely, at some future time 


There are four mutually exclusive ordering relations for comparing floating-point 
values; one relation is always true and the others are false. The familiar relations 
are greater than, less than, and equal. In addition, the IEEE floating-point 
standard defines the relation unordered, which is true when at least one operand 
value is NaN; NaN compares unordered with everything, including itself. 
Comparisons ignore the sign of zero, so +0 equals —0. 
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C.cond.fmt ae C.cond.fmt 


(continued) 


The comparison condition is a logical predicate, or equation, of the ordering 
relations such as less than or equal, equal, not less than, or unordered or equal. 
Compare distinguishes among the 16 comparison predicates. The Boolean result 
of the instruction is obtained by substituting the Boolean value of each ordering 
relation for the two floating-point values in the equation. If the equal relation is 
true, for example, then all four example predicates above yield a true result. If the 
unordered relation is true, then only the final predicate, unordered or equal, yields 
a true result. 


Logical negation of a compare result allows eight distinct comparisons to test for 
the 16 predicates, as shown in Table 18-6. Each mnemonic tests for both a 
predicate and its logical negation. For each mnemonic, compare tests the truth of 
the first predicate. When the first predicate is true, the result is true as shown in 
the “If Predicate Is True” column and the second predicate must be false, and vice 
versa. (Note that the False predicate is never true and False/True do not follow the 
normal pattern.) 


The truth of the second predicate is the logical negation of the instruction result. 

After a Compare instruction, a test for the truth of the first predicate can be made 
with the Branch on FPU True (BCIT) instruction and the truth of the second can 
be made with the Branch on FPU False (BC1F) instruction. 


Vr5432 Microprocessor User’s Manual 593 


Chapter 18 


C.cond.fmt eae C.cond.fmt 


594 


(continued) 


Table 18-7 shows another set of eight compare operations, distinguished by a 
cond; value of 1 and testing the same 16 conditions. For these additional 
comparisons, if at least one of the operands is a NaN, including a Quiet NaN 
(QNaN), then an Invalid Operation condition is raised. If the Invalid Operation 
condition is enabled in the FCSR, an Invalid Operation exception occurs. 
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(continued) 


Floating-Point Unit Instruction Set 


C.cond.fmt 


Table 18-6 FPU Comparisons Without Special Operand Exceptions 


Instruction Comparison Predicate Comparison C me nee 
Result tion 
Relation Inv. Condi- 
Condition Name of Predicate and Value P es t a one 
Mnemonic | Logically Negated Predicate (Abbreviation ee ak 
>|l<l=al? Is True if 3 |2..0 
QNaN? 
F False [this predicate is always False] |F |F |F |F 7 6 
True (T) T |T |T|T 
Unordered F /F /F IT |T 
UN 1 
Ordered (OR) T|T |T/F IF 
Equal FF |T JF /T 
EQ 2 
Not Equal (NEQ) T|T |F/T JF 
Unordered or Equal FF |T |T /T 
UEQ Ordered or Greater Than or Less Than 3 
(OGL) T|T |F JF IF 
Ordered or Less Than F/T /F |F |T 
OLT Unordered or Greater Than or Equal 4 
(UGE) TF |T/T IF 
CLT Unordered or Less Than F/T /F |T |T No 0 P 
Ordered or Greater Than or Equal (OGE)|T |F |T |F |F 
OLE Ordered or Less Than or Equal F{T |T |F /T 6 
Unordered or Greater Than (UGT) T|F /F/T IF 
ULE Unordered or Less Than or Equal F{T |T|T /T 2 
Ordered or Greater Than (OGT) T/|F /F JF IF 
Key: ? = unordered, > = greater than, < = less than, = is equal, T = True, F = False 
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(continued) 
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Table 18-7 FPU Comparisons With Special Operand Exceptions for QNaN 
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Instruction Comparison Predicate Pomparsonce inst oe 
Result tion 
Relation Inv. Condi- 
Condition Name of Predicate and values P t mak Home 
Mnemonic | Logically Negated Predicate (Abbreviation tailed ae 
>|<l=]? Is True if 3 |2..0 
QNaN? 
Signaling False [this predicate is always FIP IFIP 
SE False] F 0 
Signaling True (ST) T|T/T|T 
Not Greater Than or Less Than or Equal |F |F |F |T |T 
NGLE Greater Than or Less Than or Equal 1 
(GLE) T|T |T/F IF 
Signaling Equal F JF /T |F |T 
SEQ & g iq 2 
Signaling Not Equal (SNE) T|T)JF|T|F 
Not Greater Than or Less Than F JF /T |T |T 
NGL Yes 1 3 
Greater Than or Less Than (GL) T|T |F JF IF 
Less Than F/T /F IF |T 
LT 4 
Not Less Than (NLT) TF TIT I/F 
Not Greater Than or Equal F {T |F {T/T 
NGE 5 
Greater Than or Equal (GE) T|F /T/F IF 
ca Less Than or Equal F/T |T IF /T é 
Not Less Than or Equal (NLE) TF JF/T I/F 
Not Greater Than F/T |T |T |T 
NGT 7 
Greater Than (GT) T|F /F JF IF 
Key: ? = unordered, > = greater than, < = less than, = is equal, T = True, F = False 
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The instruction encoding is an extension made in the MIP SIV architecture. In 
previous architecture levels, the cc field for this instruction must equal 0. 


The MIPS I architecture defines a single floating-point condition code, 
implemented as the Coprocess orl condition signal (Cp/ Cond) and the C bit in the 
FCR31 register. MIP SI, I, and II architectures must have the cc field set to 0, 
which is implied by the first format in the “Format” section. Both assembler 
formats are valid for MIPS IV. 


The MIPS IV architecture adds seven more condition code bits to the original 
condition code 0. Floating-Point Compare and Conditional Branch instructions 
specify the condition code bit to set or test. 


The fields fs and ft must specify FPRs valid for operands of type fmt; if they are 
not valid, the result is undefined. 


The operands must be values in format fmt; if they are not, the result is undefined 
and the value of the operand FPRs becomes undefined. 


In the MIPS I, II, and II implementations, there must be at least one instruction 
between the Compare instruction that sets the condition code and the Branch 
instruction that tests it. Hardware does not detect a violation of this restriction. In 
the MIPS IV instruction set, this restriction has been removed. 
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Operation: 


32,64 iT: if NaN (ValueFPR (fs, fmt) ) or NaN (ValueFPR (ft, fmt) ) then 
less < false 
equal < false 
unordered < true 
if condg then 
signal InvalidOperationException 


endif 

else 
less <— ValueFPR (fs, fmt) < ValueFPR (ft, fmt) 
equal <— ValueFPR (fs, fmt) = ValueFPR (ft, fmt) 
unordered <— false 

endif 


condition < (conds and less) or (cond; and equal) or 
(condg and unordered) 
SetFPConditionCode (cc, condition) 
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(continued) 


Note: Floating-point computational instructions, including compare, that 
receive an operand value of Signaling NaN (SNan) raise the Invalid 
Operation condition. Comparisons that raise the Invalid Operation 
condition for Quiet NaNs in addition to SNaNs permit a simpler 
programming model if NaNs are errors. Using these compares, pro- 
grams do not need explicit code to check for QNaNs causing the 
unordered relation. Instead, they take an exception and allow the 
exception handling system to deal with the error when it occurs 
For example, consider a comparison in which we want to know i 
two numbers are equal, but for which unordered would be an er- 
ror. 


# comparisons using explicit tests for QNaN 


c.eq.d S£2,S£4 # check for equal 
nop 
belt L2 # it is equal 
c.un.d S£2,S£4 # it is not equal, 
# but might be unordered 
belt ERROR # unordered goes off to an error handler 


# not-equal-case code here 


# equal-case code here 


L2': 

# 

# comparison using comparisons that signal QNaN 
c.seq.d S£2,S£4 # check for equal 
nop 
belt L2 # it is equal 
nop 


# it is not unordered here 
# not-equal-case code here 


#equal-case code here 
L2: 
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(continued) 


Exceptions: 
Coprocessor Unusable exception 
Reserved Instruction exception 
Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
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CEIL.L.tmt 9 fewer" CEIL.L.fmt 


Fixed-Point Format 


21 20 16 15 


0 CEIL.L 
00000 001010 
5 6 
Format: 
CEIL.L.fmt fd, fs (MIPS ITI format) 
Description: 


The contents of floating-point register fs are arithmetically converted into a 64-bit 
fixed-point format and the result is stored in floating-point register fd. The source 
operand is processed in the floating-point format fmt. 


The result of the conversion is rounded toward the + o direction, regardless of the 
current rounding mode. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN, and if the rounded result is outside the 
range of 22) ty 2°? = 1, ‘the Unimplemented Operation exception occurs. If the 
Unimplemented Operation exception is not enabled, the exception does not occur, 
and 2°*— 1 is returned. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. If this 
instruction is executed during 32-bit User or Supervisor mode, a Reserved 
Instruction exception occurs. 
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CEIL.L.fmt Ceiling to Long CEIL.L.fmt 
Fixed-Point Format 
(continued) 


Operation: 


32,64T: StoreFPR (fd, L, ConvertFmt (ValueFPR (fs, fmt) , fmt, L) ) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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CEIL.W.fmt —catingtosinge = CEIL.W.fmt 


Fixed-Point Format 


26 25 21 20 16 15 
0 CEIL.W 
00000 001110 
5 6 
Format: 
CEIL.W.fmt fd, fs (MIPS II format) 
Description: 


The contents of floating-point register fs are arithmetically converted into a 32-bit 
fixed-point format, and the result is stored in floating-point register fd. The source 
operand is processed in the floating-point format fmt. 


The result of the conversion is rounded toward the + - direction, regardless of the 
current rounding mode. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN, and if the rounded result is outside the 
range of 23! 1 to -23!, the Invalid Operation exception occurs. If the Invalid 
Operation exception is not enabled, the exception does not occur, and 971-1 is 
returned. 
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Fixed-Point Format 
(continued) 


Operation: 


32,64T: | StoreFPR (fd, W, ConvertFmt (ValueFPR (fs, fmt) , fmt, W) ) 


Exceptions: 
Coprocessor Unusable exception 
Floating-Point exception 
Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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CFC1 Move Control Word from FPU CFC1 


(Coprocessor 1) 


31 26 25 21 20 16 15 11 10 0 
COP1 CF rt fs 0) 
010001 00010 00000000000 
6 5 5 5 11 
Format: 
CFCI1 rt, fs (MIPS I format) 
Description: 


The contents of floating-point control register fs are loaded into general-purpose 
register rt. 


This instruction is only defined when fs equals 0 or 31. 


The contents of general-purpose register rt are undefined while the instruction 
immediately following this Load instruction is being executed. 


Operation: 


32 T: temp < FCRi[fs] 
T+1: GPR[rt] — temp 


64 T: temp < FCRifs] 
T+1: GPR[irt] <— (temp3;)°" || temp 


Exceptions: 


Coprocessor Unusable exception 
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CTC1 Move Control Word to FPU CTC1 


(Coprocessor 1) 


31 26 25 21 20 16 15 11 10 0 
COP1 CT rt fs 0 
010001 00110 00000000000 
6 5 5 5 11 
Format: 
CTCI rt, fs (MIPS I format) 
Description: 


The contents of general-purpose register rt are stored in floating-point control 
register fs. This instruction is defined only if fs is 0 or 31. 


If the cause bit of the floating-point Control/Status register (FCR31) and the 
corresponding enable bit are set by writing data to FCR31, the Floating-Point 
exception occurs. Write the data to the register before the exception occurs. 


The contents of floating-point control register fs are undefined while the 
instruction immediately following this instruction is executed. 


Operation: 


32 T: temp < GPR[rt] 
T+1:  FCR[fs] < temp 
COC[1] — FCR[31]o3 
64 T: temp < GPR[rt]s1_.0 


T+1: FCRI[fs] < temp 
COC[1] — FCR[31]o3 
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CTC1 Move Control Word to FPU CTC1 


(Coprocessor 1) 
(continued) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Division by Zero exception 

Inexact Operation exception 
Overflow exception 

Underflow exception 
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CVT.D.fmt pane CVT.D.fmt 


Floating-Point Format 


31 


26 25 21 20 16 15 1110 6 5 0 


COP1 
010001 


fmt 0 fs fd CVT.D 
00000 100001 


6 


5 5 5 5 6 


Format: 


CVT.D:S fd, fs (MIPS I format, fmt = S) 
CVT.D.W fd, fs (MIPS I format, fmt = W) 
CVT.D.L fd, fs (MIPS III format, fmt = L) 


Description: 


The contents of floating-point register fs are arithmetically converted to a double- 
precision floating-point format; the result is stored in floating-point register fd. 
The source operand is processed in the floating-point format fmt. 


This instruction is valid only for conversion from the single-precision floating- 
point format and the 32-bit or 64-bit fixed-point formats. 


In the single-precision floating-point format or 32-bit fixed-point format, this 
conversion operation is executed correctly without losing any accuracy. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


Operation: 


32,64T: StoreFPR (fd, D, ConvertFmt (ValueFPR (fs, fmt), fmt, D) ) 
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CVT.D.fmt pata CVT.D.fmt 


Floating-Point Format 
(continued) 


Exceptions: 
Coprocessor Unusable exception 
Floating-Point exception 
Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Inexact Operation exception 
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fmt paieenah ay CVT.L.fmt 


Fixed-Point Format 


31 26 25 21 20 16 15 11.10 6 5 0 
COP1 fmt 0 fs fd CVT.L 
010001 00000 100101 
6 5 5 5 5 6 
Format: 
CVT.L.fmt fd, fs (MIPS III format) 
Description: 

The contents of floating-point register fs are arithmetically converted into a 64-bit 
fixed-point format; the result is stored in floating-point register fd. The source 
operand is processed in the floating-point format fmt. 
This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 
If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 
If the source operand is infinite or NaN, and if the rounded result is outside the 
range of 2972 49:9"? — the Unimplemented Operation exception occurs. If the 
Unimplemented Operation exception is not enabled, the exception does not occur, 
and 2>* — 1 is returned. 
This operation is defined in 64-bit mode and 32-bit Kernel mode. If this 
instruction is executed during 32-bit User or Supervisor mode, a Reserved 
Instruction exception occurs. 
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CVT.L.fmt ea CVT.L.fmt 


Fixed-Point Format 
(continued) 


Operation: 


64 TT: StoreFPR (fd, L, ConvertFmt (ValueFPR (fs, fmt), fmt, L) ) 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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fmt Senha CVT.S.fmt 


Floating-Point Format 


31 26 25 21 20 16 15 1110 6 5 0 
COP1 fmt 0 fs fd CVT.S 
010001 00000 100000 
6 5 5 5 5 6 

Format: 
CVT.S.D fd, fs (MIPS I format, fmt = D) 
CVT.S.W fd, fs (MIPS I format, fmt = W) 
CVT.S.L fd, fs (MIPS III format, fmt = L) 

Description: 
The contents of floating-point register fs are arithmetically converted into a single- 
precision floating-point format; the result is stored in floating-point register fd. 
The source operand is processed in the floating-point format fmt. The result of the 
conversion is rounded according to the current rounding mode. 
This instruction is valid only for conversion from the double-precision floating- 
point format, and 32-bit or 64-bit fixed-point format. 
If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 

Operation: 

32,64T: | StoreFPR (fd, S, ConvertFmt (ValueFPR (fs, fmt), fmt, S) ) 
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Floating-Point 
CVT.S.fmt Convert to Single CVT.S.fmt 
Floating-Point Format 
(continued) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 

Underflow exception 
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CVT.W.fmt pests CVT.W.fmt 


Fixed-Point Format 


31 


26 25 21 20 16 15 11.10 6 5 0 


COP1 
010001 


fmt 0 fs fd CVT.W 
00000 100100 


6 


5 5 5 5 6 


614 


Format: 


CVT.W fmt fd, fs (MIPS I format) 


Description: 


The contents of floating-point register fs are arithmetically converted to a 32-bit 
fixed-point format and the result is stored in floating-point register fd. The source 
operand is processed in the floating-point format fmt. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN and if the rounded result is outside the 
range of 23! 1 to -23!, the Invalid Operation exception occurs. If the Invalid 
Operation exception is not enabled, the exception does not occur and Oe Vis 
returned. 
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CVT.W.fmt aa CVT.W.fmt 


Fixed-Point Format 
(continued) 


Operation: 


32,64T: | StoreFPR (fd, W, ConvertFmt (ValueFPR (fs, fmt), fmt, W) ) 


Exceptions: 
Coprocessor Unusable exception 
Floating-Point exception 
Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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DIV.fmt 


Floating-Point Divide 


DIV.fmt 


31 26 25 21 20 16 15 


11 10 


COP1 
010001 


fmt 


DIV 
000011 


6 


6 


Format: 
DIV.fmt fd, fs, ft 


Description: 


(MIPS I format) 


The contents of floating-point register fs are divided by those of floating-point 


register ft, and the result is stored in floating-point register rd. The operand is 


processed in the floating-point format fmt. The operation is executed as if the 


accuracy were infinite, and the result is rounded according to the current rounding 


mode. 


This instruction is valid only for conversion from the single- or double-precision 


floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 


register number because adjacent even-numbered and odd-numbered registers are 


used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 


valid. 


Operation: 


32,64 =T: 


StoreFPR (fd, fmt, ValueFPR (fs, fmt)/ValueFPR (ft, fmt) ) 


Exceptions: 
Coprocessor Unusable exception 
Floating-Point exception 
Floating-Point Exceptions: 


Unimplemented Operation exception 
Division by Zero exception 
Overflow exception 
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Invalid Operation exception 
Inexact Operation exception 
Underflow exception 
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DM FC1 Doubleword Move from FPU DM FC1 


(Coprocessor 1) 


31 26 25 21 20 16 15 1110 0 
COP1 DMF rt fs 0 
010001 00001 000 00000000 
6 5 5 5 11 
Format: 
DMFCI rt, fs (MIPS III format) 
Description: 


The contents of FPU general-purpose register fs are stored in CPU general- 
purpose register rt. 


The contents of general-purpose register rt are undefined while the instruction 
immediately following this instruction is being executed. 


The FR bit of the Status register indicates whether all 32 registers of the FPU can 
be specified. If the FR bit is 0 and the least-significant bit of fs is 1, this instruction 
is undefined. 


The operation is undefined if an odd number is specified when the FR bit of the 
Status register is 0. If the FR bit is 1, both odd-numbered and even-numbered 
registers are valid. 


This operation is defined in 64-bit mode or 32-bit Kernel mode. 
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DMFC1 


Doubleword Move from FPU 
(Coprocessor 1) 
(continued) 


DMFC1 


Operation: 
64 T: if SRog = 1 then 
data < FGR [fs] 
else 
if fSg = 0 then 
data < FGR [fs + 1] || FGR [fs] 
else 
data <— undefined®* 
endif 
T+1: GPR[rt] < data 


Exceptions: 


Same operation in 32-bit Kernel mode. 


Coprocessor Unusable exception 
Floating-Point exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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Doubleword Move to FPU 
(Coprocessor 1) D MTC1 


31 26 25 21 20 16 15 1110 0 
COP1 DMT rt fs 0 
010001 00101 000 00000000 
6 5 5 5 11 
Format: 
DMTCIL rt, fs (MIPS III format) 
Description: 


The contents of CPU general-purpose register rt are stored in FPU general- 
purpose register fs. 


The contents of fs are undefined while the instruction immediately following this 
instruction is being executed. 


The FR bit of the Status register indicates whether all the 32 registers of the FPU 
can be specified. If the FR bit is 0 and the least-significant bit of fs is 1, this 
instruction is undefined. 


The operation is undefined if an odd number is specified when the FR bit of the 
Status register is 0. If the F'R bit is 1, both odd-numbered and even-numbered 
registers are valid. 


This operation is defined in 64-bit mode or 32-bit Kernel mode. 
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Doubleword Move to FPU 
DMTC1 (Coprocessor 1) DMTC1 


(continued) 


Operation: 


64 T: data — GPR[ri] 


T+1: if SRog =1 then 
FGR [fs] < data 
else 


if fsg = 0 then 
FGR [fs+1] <— datag3, 30 
FGR [fs] <— data31.0 
else 
undefined_ result 
endif 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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FLOOR.L.fmt  fccrtotny | .-FLOOR.L.fmt 


Fixed-Point Format 


26 25 21 20 16 15 
0 FLOOR.L 
00000 001011 
5 6 
Format: 
FLOOR.L.fmt fd, fs (MIPS III format) 
Description: 


The contents of floating-point register fs are arithmetically converted into a 64-bit 
fixed-point format and the result is stored in floating-point register fd. The source 
operand is processed in the floating-point format fmt. 


The result of the conversion is rounded toward the — direction, regardless of the 
current rounding mode. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN and if the rounded result is outside the 
range of 29 ty 2°? = 1; ‘the Unimplemented Operation exception occurs. If the 
Unimplemented Operation exception is not enabled, the exception does not occur 
and 2°? — 1 is returned. 


This operation is defined in the 64-bit mode and 32-bit Kernel mode. If this 
instruction is executed during 32-bit User/Supervisor mode, a Reserved 
Instruction exception occurs. 
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FLOOR.L.fmt Floating-Point FLOOR.L.fmt 


Floor to Long 
Fixed-Point Format 
(continued) 


Operation: 


64 TT:  StoreFPR (fd, L, ConvertFmt (ValueFPR (fs, fmt) , fmt, L) ) 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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FLOOR.W.fmt sere cnce FLOOR.W.fmt 


Floor to Single 
Fixed-Point Format 


31 26 25 21 20 16 15 11 10 6 5 0 
COP1 fmt 0 fs fd FLOOR.W 
010001 00000 001111 
6 5 5 5 5 6 
Format: 
FLOOR.W.fmt fd, fs (MIPS II format) 
Description: 


The contents of floating-point register fs are arithmetically converted into a 32-bit 
fixed-point format; the result is stored in floating-point register fd. The source 
operand is processed in the floating-point format fmt. 


The result of the conversion is rounded toward the — ~ direction, regardless of the 
current rounding mode. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN and if the rounded result is outside the 
range of 23! 1 to -23!, the Invalid Operation exception occurs. If the Invalid 
Operation exception is not enabled, the exception does not occur and Or eais 
returned. 
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FLOOR.W.fmt fcc esrce FLOOR.W.fmt 


Floor to Single 


Fixed-Point Format 
(continued) 


Operation: 


32,64T: | StoreFPR (fd, W, ConvertFmt (ValueFPR (fs, fmt) , fmt, W) ) 


Exceptions: 
Coprocessor Unusable exception 
Floating-Point exception 
Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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LDC1 


Floating-Point Unit Instruction Set 


Load Doubleword to FPU 


(Coprocessor 1) L D C 1 


31 26 25 21 20 16 15 0 
LDC1 base ft offset 
110101 
6 5 5 16 
Format: 
LDC1 ft, offset (base) (MIPS II format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. 


If the FR bit of the Status register is 0, the contents of the doubleword at the 
memory location specified by the virtual address are stored in floating-point 
registers ft and ft + /. At this time, the high-order 32 bits of the doubleword are 
stored in an odd-numbered register specified by ft + / and the low-order 32 bits 
are stored in an even-numbered register specified by ft. The operation is undefined 
if the least-significant bit in the ft field is not 0. 


Ifthe FR bit is 1, the contents of the doubleword at the memory location specified 
by the virtual address are stored in floating-point register ft. 


If any of the low-order three bits of the address is not zero, an Address Error 
exception occurs. 
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Load Doubleword to FPU 
L D C 1 (Coprocessor 1) L D C 1 


(continued) 


Operation: 


32 T:  vAddr < ( (offsets) '® || offsetys 0) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data < LoadMemory (uncached, DOUBLEWORD, pAddr, vAddr, DATA) 
if SRo¢ = 1 then 
FGR [ft] — data 
elseif ftg = 0 then 
FGR [ft+1] - datag3..32 
FGR [ft] — dataz;_ 0 
else 
undefined_result 
endif 


64 T:  vAddr < ( (offsets) || offsetys 0) + GPR[base] 
(pAddr, uncached) < Address Translation (vAddr, DATA) 
data < LoadMemory (uncached, DOUBLEWORD, pAddr, vAddr, DATA) 
if SRo6 = 1 then 
FGR [ft] — data 
elseif ftg = 0 then 
FGR [ft+1] - datag3..32 
FGR [ft] — dataz;_ 0 
else 
undefined_result 
endif 


Exceptions: 


Coprocessor Unusable Exception 
TLB Miss exception 

TLB Invalid exception 

Bus Error exception 

Address Error exception 
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Floating-Point Unit Instruction Set 


LDXC1 Load Doubleword Indexed to FPU LDXC1 


(Coprocessor 1) 


31 26 25 21 20 1615 1110 65 0 
COP1X base index 0 fd LDXC1 
010011 00000 000001 
6 5 5 5 5 6 
Format: 
LDXC1 fd, index (base) (MIPS IV format) 
Description: 


The contents of the 64-bit doubleword at the memory location specified by the 
aligned effective address are fetched and placed in floating-point regi s t fd. The 
contents of general-purpose registers index and base are added to form the 
effective address. 


The Region bits of the effective address must be supplied by the contents of base. 
If EffectiveAddress¢3 ¢ # basé¢3 6, the result is undefined. 


An Address Error exception occurs if EffectiveAddress, 9 #0 (not doubleword 
aligned), and the result of the instruction is undefined. 
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LDXC1 Load Doubleword Indexed to FP LDXC1 


(Coprocessor 1) 
(continued) 


Operation: 


vAddr «+ GPR[base] + GPR[index] 
if vAddro 9 # 0° then SignalException(AddressError) endif 
(pAddr, CCA) < AddressTranslation (vAddr, DATA, LOAD) 
mem < LoadMemory(CCA, DOUBLEWORD, pAddr, vAddr, DATA) 
if FP32RegistersMode then 
FPR[fd] < data 


else 
if fdg = 0 then 
FPR[fd, ; || 0] < data 
else 
FPR[fd,_; || 0] < undefined®* 
FPR[fd,_; || 1] < undefined®* 
endif 
endif 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
Address Error exception 

TLB Refill exception 

TLB Invalid exception 
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Floating-Point Unit Instruction Set 


Load Word to FPU 
LWC1 (Coprocessor 1) LWC1 


31 26 25 21 20 16 15 0 
LWC1 base ft offset 
110001 
6 5 5 16 
Format: 
LWC1 ft, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the word at the memory 
location specified by the virtual address are loaded to floating-point register ff. 


If the FR bit of the Status register is 0 and if the least-significant bit in the ft field 
is 0, the contents of the word are stored in the low-order 32 bits of floating-point 
register ft. If the least-significant bit in the ft field is 1, the contents of the word are 
stored in the high-order 32 bits of floating-point register ft — /. 


If the FR bit is 1, all the 64-bit floating-point registers can be accessed; therefore, 
the contents of the word are stored in floating-point register ft. The value of the 
high-order 32 bits is undefined. 


If either of the low-order two bits of the address is not zero, an Address Error 
exception occurs. 


Vr5432 Microprocessor User’s Manual 629 


Chapter 18 


LWC1 


Load Word to FPU LWC1 


(Coprocessor 1) 
(continued) 


Operation: 
32 T: vAddr < ( (offset;s)'® || offsety5 o) + GPR[base] 
(pAddr, uncached) « AddressTranslation (vAddr, DATA) 
data < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
if SRo6 = 1 then 
FGR [ft] — undefined*? || data 
else 
FGR [ft] < data 
endif 
64 T: vAddr < ( (offset;s)*° || offsety5 ) + GPR[base] 
(pAddr, uncached) « AddressTranslation (vAddr, DATA) 
data < LoadMemory (uncached, WORD, pAddr, vAddr, DATA) 
if SRo6 = 1 then 
FGR [ft] — undefined? || data 
else 
FGR [ft] < data 
endif 
Exceptions: 
Coprocessor Unusable exception 
TLB Miss exception 
TLB Invalid exception 
Bus Error exception 
Address Error exception 
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Floating-Point Unit Instruction Set 


LWXC1 Load Word Indexed to FPU LWXC1 


(Coprocessor 1) 


31 26 25 21 20 1615 1110 65 0 
COP1X base index 0 fd LWXC1 
010011 00000 000000 
6 5 5 5 5 6 
Format: 
LWXC1 fd, index (base) (MIPS IV format) 
Description: 


The contents of the 32-bit word at the memory location specified by the aligned 
effective address are fetched and placed in the low word of floating-point 
register fd. The contents of general-purpose registers index and base are added to 
form the effective address. 


The Region bits of the effective address must be supplied by the contents of base. 
If EffectiveAddress¢3 ¢ # basé¢3 6, the result is undefined. 


An Address Error exception occurs if EffectiveAddress, 9 #0 (not word aligned), 
and the result of the instruction is undefined. 
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LWXC1 Load Word Indexed to FPU LWxXC1 


(Coprocessor 1) 
(continued) 


Operation: 


vAddr «+ GPRI[base] + GPR[index] 
if vAddry 9 # 07 then SignalException(AddressError) endif 
(pAddr, CCA) < AddressTranslation (vAddr, DATA, LOAD) 
pAddr <— pAddrpgize-1..3 || (PAddre 9 xor (ReverseEndian || 0°)) 
/* mem is aligned 64-bits from memory. Pick out correct bytes. */ 
mem < LoadMemory(CCA, WORD, pAddr, vAddr, DATA) 
bytesel < vAddr» 9 xor (BigEndianCPU || 0°) 
if FP32RegistersMode then 

FPR[fd] < undefined® || data 


else 
if fdg = 0 then 
FPR[fd4_4 || 0] <— FPR[fd4. + || Oles..s2 || data 
else 
FPR[fd4_ 4 || 0] < data || FPR[fd4. 4 || Ols1_o 
endif 


endif 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
Address Error exception 

TLB Refill exception 

TLB Invalid exception 
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MADD.fmt 


Floating-Point 


Floating-Point Unit Instruction Set 


MADD.fmt 


Multiply-Add 
31 26 25 21 20 1615 1110 65 32 0 
COP1X fr ft fs fd MADD | _ fmt 
010011 100 
6 5 5 5 5 3 3 


Format: 
MADD.fmt fd, fr, fs, ft 


Description: 


(MIPS IV format) 


The value in floating-point register fs is multiplied by the value in floating-point 
register ft to produce a product. The value in floating-point register fr is added to 
the product. The resulting sum is calculated to infinite precision, rounded 
according to the current rounding mode in the FCR31 register, and placed into 
floating-point register fd. The operands and result are values in format fmt. 


Cause bits are ORed into the Flag bits if no exception is taken. 


The fields fr, fs, ft, and fd must specify floating-point registers valid for operands 


of type fmt; if they are not valid, the result is undefined. 


The operands must be values in format fmt; if they are not, the result is undefined 
and the value of the operand floating-point registers becomes undefined. 


Operation: 


vir — ValueFPR(fr, fmt) 

vis <— ValueFPR(fs, fmt) 

vit — ValueFPR(ft, fmt) 

StoreFPR(fd, fmt, vfr tint (VES Xfmt vit) 
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Chapter 18 


Floating-Point 
MADD.fmt sicnieiy Ai 


(continued) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
Overflow exception 

Underflow exception 

Inexact Operation exception 
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Move Word from FPU 
M FC1 (Coprocessor 1) M FC 1 


31 26 25 21 20 16 15 11 10 0 
COP1 MF rt fs 0 
010001 00000 00000000000 
6 5 5 5 11 

Format: 
MEFCI rt, fs (MIPS I format) 

Description: 
The contents of floating-point general-purpose register fs are stored in general- 
purpose register rt of the CPU register rt. 
The contents of general-purpose register rt are undefined while the instruction 
immediately following this instruction is being executed. 
If the FR bit of the Status register is 0 and if the least-significant bit in the ft field 
is 0, the low-order 32 bits of floating-point register ft are stored in CPU general- 
purpose register rt. If the least-significant bit in the ft area is 1, the high-order 32 
bits of floating-point register ft — / are stored in general-purpose register rt. 
If the FR bit is 1, all 64-bit floating-point registers can be accessed; therefore, the 
high-order 32 bits of floating-point register ft are stored in CPU general-purpose 
register rt. 

Operation: 

32 T: data < FGR [fs]31..0 
T+1: GPR [rt] < data 

64 T: data < FGR [fs]31._.0 
T+1: GPR[rt] < (datas;)° || data 

Exceptions: 


Coprocessor Unusable exception 
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MOV.fmt Floating-Point Move MOV.fmt 


31 26 25 21 20 16 15 1110 6 5 0 
COP1 fmt 0 fs fd MOV 
010001 00000 000110 
6 5 5 5 5 6 

Format: 
MOV.fmt fd, fs (MIPS I format) 

Description: 
The contents of floating-point register fs are stored in floating-point register fd. 
The operand is processed in the floating-point format fmt. 
This instruction is not executed arithmetically, and no IEEE-754 exception is 
generated. 
This instruction is valid only in the single- and double-precision floating-point 
formats. 
If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 

Operation: 


32, 64 


T: StoreFPR (fd, fmt, ValueFPR (fs, fmt) ) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 
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Unimplemented Operation exception 
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MOVE Move Conditional on FPU False MOVE 


31 26 25 21 20 18171615 1110 65 0 
SPECIAL rs cc 0 | tf rd 0 MOVCI 
000000 0/0 00000 | 000001 

6 5 3 1 1 5 5 6 
Format: 
MOVF rd, rs, cc (MIPS IV format) 
Description: 


If the floating-point condition code specified by cc is zero, then the contents of 
general-purpose register rs are placed into general-purpose register rd. 


Operation: 


if FPConditionCode(cc) = 0 then 
GPR[rd] < GPR[rs] 
endif 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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MOVF.fmt —condttionat on FPU False MOVF.fmt 


31 26 25 21 20 18171615 1110 65 0 
COP1 fmt cc 0 | tf fs fd MOVCF 
010001 0/0 010001 
6 5 3 1 1 5 5 6 
Format: 
MOVF.fmt fd, fs, cc (MIPS IV format) 
Description: 


If the floating-point condition code specified by cc is zero, then the value in 
floating-point register fs is placed into floating-point register fd. The source and 
destination are values in format fmt. 


If the condition code is not zero, then floating-point register fs is not copied and 
floating-point register fd retains its previous value in format fmt. If fd did not 
contain a value either in format fmt or previously unused data from a load or 
move-to operation that could be interpreted in format fmt, then the value of fd 
becomes undefined. The fields fs and fd must specify floating-point registers valid 
for operands of type fmt; if they are not valid, the result is undefined. 


The move is nonarithmetic; it causes no IEEE-754 exceptions. 
Operation: 


if FRConditionCode(cc) = 0 then 
StoreFPR(fd, fmt, ValueFPR(fs, fmt)) 
else 
StoreFPR(fd, fmt, ValueFPR(fd, fmt)) 
endif 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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MOVN.fmt 


Floating-Point Move 
Conditional on Not Zero 


Floating-Point Unit Instruction Set 


MOVN.fmt 


31 


26 25 


21 20 


1615 


11 10 


65 


COP1 
010001 


fmt 


6 


Format: 
MOVN.fmt fd, fs, rt 


Description: 


(MIPS IV format) 


If the value in general-purpose register rt is not equal to zero, then the value in 
floating-point register fs is placed in floating-point register fd. The source and 
destination are values in format fmt. 


If general-purpose register rf contains zero, then floating-point register fs is not 
copied and floating-point register fd contains its previous value in format fmt. If 
fd did not contain a value either in format fmt or previously unused data from a 
load or move-to operation that could be interpreted in format fmt, then the value 
of fd becomes undefined. The fields fs and fd must specify floating-point registers 
valid for operands of type fmt; if they are not valid, the result is undefined. 


The move is nonarithmetic; it causes no IEEE-754 exceptions. 


Operation: 


if GPR[rt] 4 0 then 


else 


endif 


StoreFPR(fd, fmt, ValueFPR(fs, fmt) 


StoreFPR(fd, fmt, ValueFPR(fd, fmt)) 


Exceptions: 


Coprocessor Unusable exception 


Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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MOVT Move Conditional on FPU True MOVT 


31 26 25 21 20 18171615 11 10 65 0 
SPECIAL rs cc 0 | tf rd 0 MOVT 
000000 0) 1 00000 | 000001 

6 5 3 1 1 5 5 6 
Format: 
MOVT rd, rs, cc (MIPS IV format) 
Description: 


If the floating-point condition code specified by cc is one, then the contents of 
general-purpose register rs are placed into general-purpose register rd. 


Operation: 


if FPConditionCode(cc) = 1 then 
GPR[rd] < GPR[rs] 
endif 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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MOVT.fmt —concitionat on FPU True MOVT.fmt 


31 26 25 21 20 18171615 1110 65 0 
COP1 fmt cc 0 | tf fs fd MOVT 
010001 0|1 010001 
6 5 3 1 1 5 5 6 
Format: 
MOVT.fmt fd, fs, cc (MIPS IV format) 
Description: 


If the floating-point condition code specified by cc is one, then the value in 
floating-point register fs is placed into floating-point register fd. The source and 
destination are values in format fmt. 


If the condition code is not one, then floating-point register fs is not copied and 
floating-point register fd retains its previous value in format fmt. If fd did not 
contain a value either in format fmt or previously unused data from a load or 
move-to operation that could be interpreted in format fmt, then the value of fd 
becomes undefined. The fields fs and fd must specify floating-point registers valid 
for operands of type fmt; if they are not valid, the result is undefined. 


The move is nonarithmetic; it causes no IEEE-754 exceptions. 
Operation: 


if FRConditionCode(cc) = 1 then 
StoreFPR(fd, fmt, ValueFPR(fs, fmt)) 
else 
StoreFPR(fd, fmt, ValueFPR(fd, fmt)) 
endif 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 


Vr5432 Microprocessor User’s Manual 641 


Chapter 18 


MOVZ.fmt in i petit MOVZ.fmt 


31 26 25 21 20 1615 11 10 65 0 
COP1 fmt rt fs fd MOVZ 
010001 010010 
6 5 5 5 5 6 

Format: 
MOVZ.fmt fd, fs, rt (MIPS IV format) 
Description: 


If the value in general-purpose register rt is equal to zero, then the value in 
floating-point register fs is placed in floating-point register fd. The source and 
destination are values in format fmt. 


If general-purpose register rt does not contain zero, then floating-point register fs 
is not copied and floating-point register fd contains its previous value in format 
jmt. If fd did not contain a value either in format fmt or previously unused data 
from a load or move-to operation that could be interpreted in format fmt, then the 
value of fd becomes undefined. The fields fs and fd must specify floating-point 
registers valid for operands of type fmt; if they are not valid, the result is 
undefined. 


The move is nonarithmetic; it causes no IEEE-754 exceptions. 


Operation: 


if GPR[rt] = 0 then 

StoreFPR(fd, fmt, ValueFPR(fs, fmt)) 
else 

StoreFPR(fd, fmt, ValueFPR(fd, fmt)) 


endif 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
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MSUB.fmt 


Floating-Point 
Multiply-Subtract 


Floating-Point Unit Instruction Set 


MSUB.fmt 


MSUB.fmt fd, fr, fs, ft 


Description: 


(MIPS IV format) 


31 26 25 2120 1615 1110 65 32 0 
COP1X fr ft fs fd MSUB fmt 
010011 101 
6 5 5 5 5 3 3 
Format: 


The value in floating-point register fs is multiplied by the value in floating-point 
register ft to produce a product. The value in floating-point register fr is subtracted 
from the product. The subtraction result is calculated to infinite precision, rounded 
according to the current rounding mode in the FCR31 register, and placed into 

floating-point register fd. The operands and result are values in format fmt. 


Cause bits are ORed into the Flag bits if no exception is taken. 


The fields fr, fs, ft, and fd must specify floating-point registers valid for operands 


of type fmt; if they are not valid, the result is undefined. 


The operands must be values in format fmt; if they are not, the result is undefined 
and the value of the operand floating-point registers becomes undefined. 


Operation: 


vir — ValueFPR(fr, fmt) 

vis <— ValueFPR(fs, fmt) 

vit — ValueFPR(ft, fmt) 

StoreFPR(fd, fmt, (vfs Xfm¢ Vit) fmt vir) 
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MSUB.fmt Pachuca MSUB.fmt 


Multiply-Subtract 
(continued) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
Overflow exception 

Underflow exception 

Inexact Operation exception 
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Move to FPU 
MTC1 (Coprocessor 1) MTC1 
31 26 25 21 20 16 15 11 10 0 
COP1 MT rt fs 0 
010001 00100 00000000000 
6 5 5 5 11 

Format: 
MTCI rt, fs (MIPS I format) 

Description: 
The contents of CPU general-purpose register rt are stored in the floating-point 
general-purpose register fs. 
The contents of floating-point register fs are undefined while the instruction 
immediately following this instruction is being executed. 
If the FR bit of the Status register is 0, only an even number can be specified as a 
register number because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 
If the FR bit is 1, all of the 32 floating-point general-purpose registers can be 
accessed, but only the low-order 32 bits are affected by this instruction. 

Operation: 

32,64 T: data — GPR [rt]231 9 
T+1: if SRog= 1 then 

FGR [fs] < undefined?? || data 
else 
FGR [fs] < data 
endif 
Exceptions: 


Coprocessor Unusable exception 
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M U L.fmt Floating-Point Multiply M U L.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 
COP1 fmt ft fs fd MUL 
010001 000010 
6 5 5 5 5 6 

Format: 
MUL.fmt fd, fs, ft (MIPS I format) 

Description: 
The contents of floating-point register fs are multiplied by those of floating-point 
register ft, and the result is stored in floating-point register fd. The operand is 
processed in the floating-point format fmt. 
This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 
If the FR bit of the Status register is 0, only an even number can be specified as a 
register number because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 

Operation: 

32,64T: StoreFPR (fd, fmt, ValueFPR (fs, fmt) * ValueFPR (ft, fmt) ) 

Exceptions: 
Coprocessor Unusable exception 
Floating-Point exception 

Floating-Point Exceptions: 
Unimplemented Operation exception 
Invalid Operation exception 
Inexact Operation exception 
Overflow exception 
Underflow exception 
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Floating-Point Unit Instruction Set 


NEG.fmt Floating-Point Negate NEG.fmt 


26 25 21 20 16 15 11.10 6 5 0 


fmt 0 fs fd NEG 
00000 000111 


5 5 5 5 6 


Format: 


NEG.fmt fd, fs (MIPS I format) 


Description: 


The sign of the contents of floating-point register fs is inverted and the result is 
stored in floating-point register fd. The operand is processed in the floating-point 
format fmt. 


The sign is inverted arithmetically. Therefore, the instruction is invalid if the 
operand is NaN. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


Operation: 


32,64T: | StoreFPR (fd, fmt, Negate (ValueFPR (fs, fmt) ) ) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
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NMADD.fmt dasvialey dt NMADD.fmt 


Negative 
Multiply-Add 
31 26 25 21 20 1615 11 10 65 32 0 
COP1X fr ft fs fd NMADD|_ fmt 
010011 110 
6 5 5 5 5 3 3 


Format: 
NMADD.fmt fd, fr, fs, ft (MIPS IV format) 


Description: 


The value in floating-point register fs is multiplied by the value in floating-point 
register ft to produce a product. The value in floating-point register fr is added to 
the product. The resulting sum is calculated to infinite precision, rounded 
according to the current rounding mode in the FCR31 register, negated by 
changing the sign bit, and placed into floating-point register fd. The operands and 
result are values in format fmt. 


Cause bits are ORed into the Flag bits if no exception is taken. 


The fields fr, fs, ft, and fd must specify floating-point registers valid for operands 
of type fmt; if they are not valid, the result is undefined. 


The operands must be values in format fmt; if they are not, the result is undefined 
and the value of the operand floating-point registers becomes undefined. 


Operation: 


vir — ValueFPR(fr, fmt) 

vfs <— ValueFPR(fs, fmt) 

vft <— ValueFPR(ft, fmt) 

StoreFPR(fd, fmt, —(vfr time (VES Xtmet vft))) 


648 Vr5432 Microprocessor User’s Manual 


Floating-Point Unit Instruction Set 


NMADD.fmt dasa da NMADD.fmt 


Negative 
Multiply-Add 


(continued) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
Overflow exception 

Underflow exception 

Inexact Operation exception 
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NMSUB.fmt hop aclhr c NMSUB.fmt 


Negative 
Multiply-Subtract 


31 26 25 21 20 1615 11 10 65 32 0 
COP1X fr ft fs fd NMSUB| _ fmt 
010011 111 
6 5 5 5 5 3 3 
Format: 
NMSUB.fmt fd, fr, fs, ft (MIPS IV format) 
Description: 


The value in floating-point register fs is multiplied by the value in floating-point 
register ft to produce a product. The value in floating-point register fr is subtracted 
from the product. The subtraction result is calculated to infinite precision, rounded 
according to the current rounding mode in the FCR31 register, negated by 
changing the sign bit, and placed into floating-point register fd. The operands and 
result are values in format fmt. 


Cause bits are ORed into the Flag bits if no exception is taken. 


The fields fr, fs, ft, and fd must specify floating-point registers valid for operands 
of type fmt; if they are not valid, the result is undefined. 


The operands must be values in format fmt; if they are not, the result is undefined 
and the value of the operand floating-point registers becomes undefined. 


Operation: 


vir — ValueFPR(fr, fmt) 

vis <— ValueFPR(fs, fmt) 

vit — ValueFPR(ft, fmt) 

StoreFPR(fd, fmt, —((vfs Xfmt Vft) fmt vfr)) 
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NMSUB.fmt or sal cht NMSUB.fmt 


Negative 
Multiply-Subtract 


(continued) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
Overflow exception 

Underflow exception 

Inexact Operation exception 
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P R E FX Prefetch Indexed =) R E FX 


31 


26 25 21 20 1615 11 10 65 0 


base index hint 0 PREFX 
00000 001111 


5 5 5 5 6 
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Format: 


PREFX hint, index (base) (MIPS IV format) 


Description: 


PREFX adds the contents of general-purpose register index to the contents of 
general-purpose register base to form an effective byte address. It presents advice 
that data at the effective address may be used in the near future. The hint field 
supplies information about the way the data is expected to be used. 


Unlike the Vr5000, in which the PREFX instruction is executed as an NOP, in the 
VR5432 data may be prefetched into the data cache as a result of executing this 
instruction. 


PREFX is an advisory instruction that may change the performance of the 
program. For all hint values, it neither changes architecturally visible state nor 
alters the meaning of the program. The supported hint values are shown in Table 
18-8. 
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PREFX 


Floating-Point Unit Instruction Set 


Prefetch Indexed Pp R E FX 


(continued) 


Table 18-8 Hint Field Values Used in PREFX Instruction 


Value 


Name 


Data Use and Desired Prefetch Action 


load 


Data is expected to be loaded (not modified). 
Fetch data as if for a load. 


store 


Data is expected to be stored or modified. 
Fetch data as if for a store. 


2-3 


Reserved 


load_streamed 


Data is expected to be loaded (not modified) but not reused 
extensively; it “streams” through the cache. 

Fetch data as if for a load and place it in the cache so that 
it does not displace data prefetched as “retained.” 


store_streamed 


Data is expected to be stored or modified but not reused 
extensively; it “streams” through the cache. 

Fetch data as if for a store and place it in the cache so that 
it does not displace data prefetched as “retained.” 


load_retained 


Data is expected to be loaded (not modified) and reused 
extensively; it should be “retained” in the cache. 

Fetch data as if for a load and place it in the cache so that 
it is not displaced by data prefetched as “streamed.” 


store_retained 


Data is expected to be stored or modified and reused 
extensively; it should be “retained” in the cache. 

Fetch data as if for a store and place it in the cache so that 
it is not displaced by data prefetched as “streamed.” 


Reserved 


writeback_invalidate 


26-31 


Reserved 
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p R E FX Prefetch Indexed p R E FX 


(continued) 


If MIPS IV instructions are supported and enabled and Coprocessor | is enabled 
(allowing access to CP1X), PREFX does not cause any addressing-related 
exceptions. If it does raise a nonaddressing-related exception condition, the 
exception condition is ignored. If an addressing-related exception condition is 
raised and ignored, no data is prefetched. In such a case, even if no data is 
prefetched, some action that is not architecturally visible—such as write-back of 
a dirty cache line—can take place. 


PREFX never generates a memory operation for a location with an uncached 
memory access type. However, it can result in a memory operation. 


The Region bits of the effective address must be supplied by the contents of base. 
If EffectiveAddress¢3 ¢ # basé¢3 6, the result of the instruction is undefined. 


Prefetch cannot prefetch data from a mapped location unless the translation for 
that location is present in the TLB. Locations in memory pages that have not been 
accessed recently may not have translations in the TLB, so prefetch may not be 
effective for such locations. 


Prefetch does not cause addressing exceptions. It does not cause an exception to 
prefetch using an address pointer value before the validity of a pointer is 
determined. 


Operation: 


vAddr «+ GPR[base] + GPR[index] 
(pAddr, CCA) <— AddressTranslation(vAddr, DATA, LOAD) 
Prefetch(CCA, pAddr, vAddr, DATA, hint) 


Exceptions: 


Reserved Instruction exception 
Coprocessor Unusable exception 
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Floating-Point Unit Instruction Set 


fmt Reciprocal RECIP.fmt 


31 26 25 21 20 1615 1110 65 0 
COP1 fmt 0 fs fd RECIP 
010001 00000 010101 
6 5 5 5 5 6 
Format: 
RECIP.fmt fd, fs (MIPS IV format) 
Description: 


The reciprocal of the value in floating-point register fs is placed into floating-point 
register fd. The operand and result are values in format fmt. 


The numeric accuracy of this operation meets the full accuracy specified by the 
TEEE-754 floating-point standard for this operation. 


The fields fs and fd must specify floating-point registers valid for operands of type 
jmt; if they are not valid, the result is undefined. 


Operation: 


StoreFPR(fd, fmt, 1.0 / valueFPR(fs, fmt)) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
Overflow exception 

Underflow exception 

Inexact Operation exception 
Division by Zero exception 
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ROUND.L.fmt = fomagieng ROUND.L.fmt 


Fixed-Point Format 


26 25 21 20 16 15 
0 ROUND.L 
00000 001000 
5 6 
Format: 
ROUND.L.fmt fd, fs (MIPS III format) 
Description: 


The contents of floating-point register fs are converted into the 64-bit fixed-point 
format and the result is stored in floating-point register fd. The source operand is 
processed in the floating-point format fmt. 


The result of the conversion is rounded to the closest value or even number, 
regardless of the current rounding mode. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN and if the rounded result is outside the 
range of 29 ty 2°? = 1; ‘the Unimplemented Operation exception occurs. If the 
Unimplemented Operation exception is not enabled, the exception does not occur 
and 2°? — 1 is returned. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. If this 
instruction is executed during 32-bit User or Supervisor mode, a Reserved 
Instruction exception occurs. 
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ROUND.L.fmt .oce,, ROUND.L.fmt 


Round to Long 
Fixed-Point Format 
(continued) 


Operation: 


64 TT:  StoreFPR (fd, L, ConvertFmt (ValueFPR (fs, fmt) , fmt, L) ) 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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ROUND.W.fmt = Freatins-Pointt ROUND.W.fmt 


Round to Single 
Fixed-Point Format 


31 


26 25 21 20 16 15 11 10 6 5 0 


COP1 
010001 


fmt 0 fs fd ROUND.W 
00000 001100 


6 


5 5 5 5 6 
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Format: 


ROUND.W.fmt fd, fs (MIPS II format) 


Description: 


The contents of floating-point register fs are converted into the 32-bit fixed-point 
format and the result is stored in floating-point register fd. The source operand is 
processed in the floating-point format fmt. 


The result of the conversion is rounded to the closest value or even number, 
regardless of the current rounding mode. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN and if the rounded result is outside the 
range of 23! 1 to -23!, the Invalid Operation exception occurs. If the Invalid 
Operation exception is not enabled, the exception does not occur and Oo) sais 
returned. 
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ROUND.W.fmt  Freating-Point ~ROUND.W.fmt 


Round to Single 
Fixed-Point Format 
(continued) 


Operation: 


32,64T: | StoreFPR (fd, W, ConvertFmt (ValueFPR (fs, fmt) , fmt, W) ) 


Exceptions: 
Coprocessor Unusable exception 
Floating-Point exception 
Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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RSQRT.fmt nha, RSQRT.fmt 


31 26 25 21 20 1615 11 10 65 0 
COP1 fmt 0 fs fd RSQRT 
010001 00000 010110 
6 5 5 5 5 6 

Format: 
RSQRT.fmt fd, fs (MIPS IV format) 

Description: 
The reciprocal of the positive square root of the value in floating-point register fs 
is placed into floating-point register fd. The operand and result are values in 
format fmt. 
The numeric accuracy of this operation meets the full accuracy specified by the 
IEEE-754 floating-point standard for this operation. 
The fields fs and fd must specify floating-point registers valid for operands of type 
jmt; if they are not valid, the result is undefined. 

Operation: 

StoreFPR(fd, fmt, 1.0 / SquareRoot(valueFPR(fs, fmt))) 

Exceptions: 
Coprocessor Unusable exception 
Reserved Instruction exception 

Floating-Point Exceptions: 
Unimplemented Operation exception 
Invalid Operation exception 
Overflow exception 
Underflow exception 
Inexact Operation exception 
Division by Zero exception 
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Store Doubleword from FPU 
SDC1 (Coprocessor 1) SDC1 
31 26 25 21 20 16 15 0 
SDC1 base ft offset 
111101 
6 5 5 16 

Format: 

SDC1 ft, offset (base) (MIPS II format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. 


The contents of floating-point registers ft and ft + / are stored in the memory 
position specified by the virtual address as a doubleword if the FR bit of the Status 
register is 0. At this time, the contents of the odd-numbered register specified by 
jt + 1 correspond to the high-order 32 bits of the doubleword and the contents of 
the even-numbered register specified by ft correspond to the low-order 32 bits. 


If the least-significant bit in the ft field is not 0, this instruction is not defined. 


If the FR bit is 1, the contents of floating-point register ft are stored in the memory 
location specified by the virtual address as a doubleword. 


If any of the low-order three bits of the address is not zero, an Address Error 
exception occurs. 
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SDC1 


Store Doubleword from FPU 
(Coprocessor 1) S DC1 


(continued) 


Operation: 


32 


64 


T: 


vAddr < ( (offset;s)'® || offset;s_ 09) + GPR [base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
if SRog = 1 

data <— FGR [ftle3...0 
elseif ftg = 0 then 

data < FGR [ft+1]31_0 || FGR [ft]31..0 
else 

data < undefine 
endif 
StoreMemory (uncached, DOUBLEWORD, data, pAddr, vAddr, DATA) 


d°4 


T: vAddr < ( (offset;s)® || offset;s__0) + GPR [base] 


(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
if SRog = 1 

data <— FGR [ftle3...0 
elseif ftg = 0 then 

data < FGR [ft+1]31..0 || FGR [ftl31...0 
else 

data < undefine 
endif 
StoreMemory (uncached, DOUBLEWORD, data, pAddr, vAddr, DATA) 


64 
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Exceptions: 


Coprocessor Unusable exception 
TLB Miss exception 

TLB Invalid exception 

TLB Modification exception 
Bus Error exception 

Address Error exception 
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S DXC1 Store Doubleword Indexed from FPU S DXC1 


(Coprocessor 1) 


31 26 25 21 20 1615 1110 65 0 
COP1X base index fs 0 SDXC1 
010011 00000 000001 
6 5 5 5 5 6 

Format: 
SDXC1 fs, index (base) (MIPS IV format) 
Description: 


The 64-bit doubleword in floating-point register fs is stored in memory at the 
location specified by the aligned effective address. The contents of general- 
purpose registers index and base are added to form the effective address. 


The Region bits of the effective address must be supplied by the contents of base. 
If EffectiveAddress¢3 — # base¢3 62, the result is undefined. 


An Address Error exception occurs if EffectiveAddress, 9 # 0 (not doubleword- 
aligned). If they are not, the result of the instruction is undefined. 


Operation: 


vAddr < GPRI[base] + GPR[index] 
if vAddro 9 # 03 then SignalException(AddressError) endif 
(pAddr, CCA) < AddressTranslation(vAddr, DATA, STORE) 
if FP32RegistersMode then 

data < FPRi[fs] 


else 
if fsg = 0 then 
data < FPR[fs4 ; || 0] 
else 
data < undefined®* 
endif 
endif 


StoreMemory(CCA, DOUBLEWORD, data, pAddr, vAddr, DATA) 
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SDXC1 Store Doubleword Indexed from FPU SDXC1 


(Coprocessor 1) 
(continued) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
Address Error exception 

TLB Refill exception 

TLB Modified exception 

TLB Invalid exception 
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SQRT.fmt SaueSREGe SQRT.fmt 


31 


26 25 21 20 16 15 11.10 6 5 0 


COP1 
010001 


fmt 0) fs fd SQRT 
00000 000100 


6 


5 5 5 5 6 


Format: 


SQRT.fmt fd, fs (MIPS II format) 


Description: 


The positive arithmetic square root of the contents of floating-point register fs is 
calculated and the result is stored in floating-point register fd. The operand is 
processed in the floating-point format fmt. The result is rounded as if calculated to 
infinite precision and then rounded according to the current rounding mode. If the 
value of the source operand is —0, the result will be —0. The result is placed in the 
floating-point register specified by fd. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


Operation: 


32,64T: | StoreFPR (fd, fmt, SquareRoot (ValueFPR (fs, fmt) ) ) 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Invalid Operation exception 
Inexact Operation exception 
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S U B -fmt Floating-Point Subtract S U B -fmt 


31 26 25 21 20 16 15 11 10 6 5 0 
COP1 fmt ft fs fd SUB 
010001 000001 
6 5 5 5 5 6 
Format: 
SUB.fmt fd, fs, ft (MIPS I format) 
Description: 
The contents of floating-point register ft are subtracted from those of floating- 
point register fs, and the result is stored in floating-point register fd. The result is 
rounded as if calculated to infinite precision and then rounded according to the 
current rounding mode. 
This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 
If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 
Operation: 
32,64T: | StoreFPR (fd, fmt, ValueFPR (fs, fmt) — ValueFPR (ft, fmt) ) 
Exceptions: 
Coprocessor Unusable exception 
Floating-Point exception 
Floating-Point Exceptions: 
Unimplemented Operation exception 
Invalid Operation exception 
Inexact Operation exception 
Overflow exception 
Underflow exception 
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Store Word from FPU 
SWC1 (Coprocessor 1) SWC1 


31 26 25 21 20 16 15 0 
SWC1 base ft offset 
111001 
6 5 5 16 
Format: 
SWC ft, offset (base) (MIPS I format) 
Description: 


The 16-bit offset is sign extended and added to the contents of general-purpose 
register base to form a virtual address. The contents of the floating-point general- 
purpose register ft are stored in the memory location at the specified address. 


If the FR bit of the Status register is 0 and the least-significant bit in the ft field is 
0, the contents of the low-order 32 bits of floating-point register ft are stored in 
memory. If the least-significant bit in the ft field is 1, the contents of the high-order 
32 bits of floating-point register ft — / are stored. 


If the FR bit is 1, all of the 64-bit floating-point registers can be accessed. The 
contents of the low-order 32 bits of the register in the ft field are stored in memory. 


If either of the low-order two bits of the address is not zero, an Address Error 
exception occurs. 
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SWC1 a coneensaat SWC1 


(continued) 


Operation: 


32 T: vAddr < ( (offsetys)'® || offsetys_ ) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data < FGR [ft]31..0 
StoreMemory (uncached, WORD, data, pAddr, vAddr, DATA) 


64 Tr vAddr < ( (offsetys)*° || offsetys_ ) + GPR[base] 
(pAddr, uncached) < AddressTranslation (vAddr, DATA) 
data <— FGR [ft]31..0 
StoreMemory (uncached, WORD, data, pAddr, vAddr, DATA) 


Exceptions: 


Coprocessor Unusable exception 
TLB Miss exception 

TLB Invalid exception 

TLB Modified exception 

Bus Error exception 

Address Error exception 


668 Vr5432 Microprocessor User’s Manual 


Floating-Point Unit Instruction Set 


SWXC1 Store Word Indexed from FPU SWXC1 


(Coprocessor 1) 


31 26 25 21 20 1615 1110 65 0 
COP1X base index fs 0 SWXC1 
010011 00000 001000 
6 5 5 5 5 6 
Format: 
SWXCI fs, index (base) (MIPS IV format) 
Description: 


The low 32-bit word from floating-point regis t efs is stored in memory at the 
location specified by the aligned effective address. The contents of general- 
purpose registers index and base are added to form the effective address. 


The Region bits of the effective address must be supplied by the contents of base. 
If EffectiveAddress¢3 — # base¢3 62, the result is undefined. 


An Address Error exception occurs if EffectiveAddress, 9 #0 (not word aligned). 
If they are not, the result of the instruction is undefined. 
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SWXC1 Store Word Indexed from FPU SWXC1 


(Coprocessor 1) 
(continued) 


Operation: 


vAddr < GPRI[base] + GPR[index] 
if vAddry 9 # 07 then SignalException(AddressError) endif 
(pAddr, CCA) < AddressTranslation(vAddr, DATA, STORE) 
pAddr <— pAddrpgize-1..3 || (PAddre 9 xor (ReverseEndian || 0°)) 
bytesel < vAddro 9 xor (BigEndianCPU || 0°) 
/* the bytes of the word are moved into the correct byte lanes */ 
if FP32RegistersMode then 

data <— FPR[fs]31.0 


else 
if fsg = O then 
data <— FPR[fs4_4 || Ols1..0 
else 
data < FPR[fsq._4 || Oles..s2 
endif 
endif 


StoreMemory (CCA, WORD, data, pAddr, vAddr, DATA) 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
Address Error exception 

TLB Refill exception 

TLB Modified exception 

TLB Invalid exception 
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TRUNC.L.fmt reectong 4 RUNC.L.fmt 


Truncate to Long 


Fixed-Point Format 


26 25 21 20 16 15 
0 TRUNC.L 
00000 001001 
5 6 
Format: 
TRUNC.L.fmt fd, fs (MIPS III format) 
Description: 


The contents of floating-point register fs are converted into the 64-bit fixed-point 
format and the result is stored in floating-point register fd. The source operand is 
processed in the floating-point format fmt. 


The result of the conversion is rounded toward 0, regardless of the current 
rounding mode. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number, because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN and if the rounded result is outside the 
range of 29 ty 2°? = 1; ‘the Unimplemented Operation exception occurs. If the 
Unimplemented Operation exception is not enabled, the exception does not occur 
and 2°? — 1 is returned. 


This operation is defined in 64-bit mode and 32-bit Kernel mode. If this 
instruction is executed during 32-bit User or Supervisor mode, a Reserved 
Instruction exception occurs. 
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TRUNC.L.fmt re, TRUNC.L.fmt 


Truncate to Long 
Fixed-Point Format 
(continued) 


Operation: 


64 TT:  StoreFPR (fd, L, ConvertFmt (ValueFPR (fs, fmt) , fmt, L) ) 


Note: Same operation in 32-bit Kernel mode. 


Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 
Reserved Instruction exception 


Floating-Point Exceptions: 


Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 
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TRUNC.W.fmt ore. «© PRUNC.W.fmt 


Truncate to 
Single Fixed-Point Format 


31 26 25 21 20 16 15 11 10 6 5 0 
COP1 fmt 0 fs fd TRUNC.W 
010001 00000 001101 
6 5 5 5 5 6 
Format: 
TRUNC.W.fmt fd, fs (MIPS II format) 
Description: 


The contents of floating-point register fs are arithmetically converted into a 32-bit 
fixed-point single format, and the result is stored in floating-point register fd. The 
source operand is processed in the floating-point format fmt. 


The result of the conversion is rounded toward 0, regardless of the current 
rounding mode. 


This instruction is valid only for conversion from the single- or double-precision 
floating-point format. 


If the FR bit of the Status register is 0, only an even number can be specified as a 
register number because adjacent even-numbered and odd-numbered registers are 
used in pairs as floating-point registers. If an odd number is specified, the 
operation is undefined. If the FR bit is 1, both odd and even register numbers are 
valid. 


If the source operand is infinite or NaN and if the rounded result is outside the 
range of 23! _ 1 to -23!, the Invalid Operation exception occurs. If the Invalid 
Operation exception is not enabled, the exception does not occur and go Vig 
returned. 
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TRUNC.W.fmt = Freating- Point “TRUNC.W.fmt 


Truncate to 
Single Fixed-Point Format 
(continued) 


Operation: 


32, 64T: 


StoreFPR (fd, W, ConvertFmt (ValueFPR (fs, fmt) , fmt, W) ) 


18.4 
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Exceptions: 


Coprocessor Unusable exception 
Floating-Point exception 


Floating-Point Exceptions: 


Invalid Operation exception 
Unimplemented Operation exception 
Inexact Operation exception 
Overflow exception 


FPU Instruction Opcode Bit Encoding 


Figure 18-3 lists the bit encoding for FPU instructions. 
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Opcode 
28...26 

31...29 0 1 2 3 4 5 6 7 
0 
1 
2 COP 
3 
4 
5 
6 LWC1 LDC1 
7 SWC1 SDC1 

sub 
23...21 

25.94 0 1 2 3 4 5 6 7 
0 MF DMFh CF g MT DMTh CT g 
1 BC g g g g g g g 
2 S D g g W Lh g g 
3 g g g g g g g g 

18...16 br 

20.19 0 1 2 3 4 5 6 7 
0 BCF BCT BCFL | BCTL * * * * 
1 * * * * * * * * 
2 * * * * * * * * 
3 * * * * * * * * 


Figure 18-3 Bit Encoding for FPU Instructions (1 of 2) 
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2.0 function 
5...3 0 1 2 3 4 5 6 7 
0 ADD SUB | MUL DIV  SQRT | ABS | MOV | NEG 
1 ROUND.LT| | TRUNC.L1N}} CEIL.LT) FLOOR.LN] | ROUND.W | TRUNC.W CEIL.W FLOOR.W/RECIP 
2 Y Y iv iy Y Y RSQRT| Y 
3 Y Y Y Y Y Y Y Y 
4 | CVT.S | CVT.D Y Y CVT.W | CVT.Ly y Y 
5 Y Y ¥ Y Y Y Y Y 
6 C.F C.UN | C.EQ | C.UEQ | C.OLT | C.ULT | C.OLE | C.ULE 
Z C.SF | C.NGLE] C.SEQ | C.NGL | C.LT | C.NGE | C.LE_ | C.NGT 


Figure 18-4 Bit Encoding for FPU Instructions (2 of 2) 


Key: 


* 
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When an opcode marked with an asterisk is executed, the 
Reserved Instruction exception occurs. These codes are reserved 
for future expansion. 


Opcodes marked with a gamma cause an Unimplemente 
Operation exception in all current implementations and ar 
reserved for future expansion 


Opcodes marked with an eta are only defined when use of the 
MIPS ITI instruction set is enabled. If the opcode is executed when 
use of the instruction set is disabled (i.e., in 32-bit User or 
Supervisor mode), the Unimplemented Operation exception 
occurs. 
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This chapter provides a detailed description of the multimedia instructions. (For 
an general overview of Vr5432 instructions, see Chapter 16.) 


19.1 Multimedia Extensions 


The VR5432 implements instructions and architectural extensions to support high- 
performance multimedia applications. These instructions interpret the 64-bit 
floating-point registers as packed vectors of eight unsigned 8-bit integers, called 
the octal byte or OB format. Considerable efficiency can be gained by operating 
in parallel on data, such as image data, in its original format rather than promoting 
it to larger integer formats. All of these instructions have a two-cycle latency and 
a one-cycle repeat rate. 
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Three types of vector operations are supported: 


¢ Vector-Vector. Each element of source vector vs is operated against 
the corresponding elements of source vector vt to produce destination 
vector vd, as shown in Figure 19-1. 


¢ Vector-Scalar. Each element of source vector vs is operated against a 
selected element of source vector vt to produce destination vecto vd, 
as shown in Figure 19-2 


* Vector-Immediate: Each element of source vector vs is operated 
against an immediate value to produce destination vecto vd, as 
shown in Figure 19-3. 


63 56 55 48 47 40 39 32.31 24 23. 16 15 87 0 
vi[7] vi[6] vt[5] 


32 31¥ ¥24 23¥ 
vd[3]_ | vd[2] 


vd[7] vd[6] vd[5] | vd[4] vd[1] vd[0} 


Figure 19-1 Vector-Vector Operation 


63 56 55 48 47 40 39 32.31 24 23 16 15 


87 0 


63 | 56 55 48 47 40 39 32 31} 24 23| 16 15 87 | 0 


56 55 ¥ ¥4847 ¥ ¥40 39 ¥32 31¥ V24 23Y 
vd{7] vd[6] vd[5] vd[4] | vd[3] | vd[2] vd{1] | vdfo} 


Figure 19-2 Vector-Scalar Operation 
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63 56 55 48 47 40 39 32.31 24 23 16 15 87 0 
vs[7] | vs[6] | vs[5] | vs[4] vs[3] vs[2] vs[1] vs[0] 


imm 


vd{7] ro vd[5] vd[4] | va[3] | vd(2] | vat] | vafoy 


Figure 19-3 Vector-Immediate Operation 


The type of vector operation is selected by a field in the instruction. The four-bit 
sel field selects the treatment of the vt operand field, as described in Table 19-1. 
When a vector-immediate operation is selected, the value of the immediate is 
taken from the vt operand field. 


Table 19-1 sel Field Encoding 


Bit Encoding Description 

0000 Vector-scalar operation; vt[0] is the source operand. 
0001 Vector-scalar operation; vt[1] is the source operand. 
0010 Vector-scalar operation; vt[2] is the source operand. 
0011 Vector-scalar operation; vt[3] is the source operand. 
0100 Vector-scalar operation; vt[4] is the source operand. 
0101 Vector-scalar operation; vt[5] is the source operand. 
0110 Vector-scalar operation; vt[6] is the source operand. 
O111 Vector-scalar operation; vt[7] is the source operand. 
1011 Vector-vector operation 

1111 Vector-immediate operation 


Vector arithmetic operations (except for multiply-accumulate and shift) are 
saturating; i.e., results that overflow or underflow are clamped to the largest or 
smallest representable values (255 and 0, respectively). No exceptions occur as a 
result of overflow or underflow. 


Vector operations can also be performed using the 192-bit Vector Accumulator as 
the destination. This register is interpreted as eight 24-bit accumulators, which is 
sometimes referred to as the OB format because it is only operated upon by data 
in the octal byte format. As with many DSP architectures, having an accumulator 
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wider than the operand data, shown in Figure 19-4, allows a series of operations 


to be performed without concern about overflow or accumulation of round-off 
error. 


Multiplier 


[a 


23 0 
Vector Accumulator [i] 


Adder 


Figure 19-4 24-Bit Accumulator 


680 Vr5432 Microprocessor User’s Manual 


Multimedia Instruction Set 


19.2 Multimedia Instruction Format 


A basic set of instructions to perform arithmetic and logical operations between 
registers is provided. In addition, instructions exist for handling unaligned data, 
permutations, comparisons, and conditional selection. For data movement, the 
standard FPU instruction set is used. The R-type format used by the multimedia 
instructions is shown in Figure 19-5. Some instructions do not require all fields, 
in which case they are sometimes used to provide additional function selection 
bits. The ALNI instruction has a unique interpretation of bits 21 through 25, not 
described by this figure. 


R-type (Register) 


31 26 25 22 21 20 16 (15 11.10 6 5 0 


function 


MEDIA: 6-bit opcode 
sel: 4-bit vector operation specifier or immediate value 
vs: 5-bit source 1 register 
vt: 5-bit source 2 register 
vd: 5-bit destination register 


function: 6-bit function field 


Figure 19-5 Multimedia Instruction Format 
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19.3 Multimedia Instructions 
Table 19-2 lists the multimedia instructions sorted by function field. 
Table 19-2 Multimedia Instructions and Operations 
rs Mnemonic Operation 
1 C.EQ.OB Vector Compare Equal 
2 PICKF.OB Vector Pick False 
3 PICKT.OB Vector Pick True 
4 C.LT.OB Vector Compare Less Than 
BS) C.LE.OB Vector Compare Less Than or Equal 
6 MIN.OB Vector Minimum 
7 MAX.OB Vector Maximum 
10 SUB.OB Vector Subtract 
11 ADD.OB Vector ADD 
12 AND.OB Vector AND 
13 XOR.OB Vector XOR 
14 OR.OB Vector OR 
15 NOR.OB Vector NOR 
16 SLL.OB Vector Shift Left Logical 
18 SRL.OB Vector Shift Right Logical 
24 ALNI.OB Vector Align 
31,sel=4 |SHFL.PACH.OB | Vector Element Shuffle 
31,sel=5 |SHFL.PACL.OB | Vector Element Shuffle 
31,sel=6 |SHFL.MIXH.OB | Vector Element Shuffle 
31,sel=7 |SHFL.MIXL.OB | Vector Element Shuffle 
32 RZU.OB Vector Scale, Round, and Clamp Accumulator 
48 MUL.OB Vector Multiply 
50,vd=0 |MULS.OB Vector Multiply and Subtract Accumulator 
50, vd= 16 |MULSL.OB Vector Multiply, Subtract, and Load Accumulator 
51,vd=0 |MULA.OB Vector Multiply-Accumulate 
51, vd=16 |MULL.OB Vector Multiply and Load Accumulator 
62,sel=0 |WACL.OB Vector Write Accumulator Low 
62,sel=8 |WACH.OB Vector Write Accumulator High 
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Table 19-2 Multimedia Instructions and Operations (continued) 


ei Mnemonic Operation 
63,sel=0 |RACL.OB Vector Read Accumulator Low 
63,sel=4 |RACM.OB Vector Read Accumulator Middle 
63,sel=8 |RACH.OB Vector Read Accumulator High 
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ADD.OB Vector Add ADD.OB 
31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt VS vd ADD 
010010 0 001011 
6 4 1 5 5 5 6 
Format: 


ADD.OB vd, vs, vt 


Description: 


684 


The values in vector vt are added to the values in vector vs. Saturated arithmetic 
is performed: overflows and underflows clamp to the largest or smallest 

representable value before writing to vector vd. The sel field selects the values of 
vt[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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(continued) 


Operation: 


Multimedia Instruction Set 


ADD.OB 


ts <— FPR[vs] 
tt — select(sel, vt) 
FPR[vd] < AddOB(ts¢3. 56, ttes..56) 
|| AddOB(ts55..48, ttss..48) 
|| AddOB(ts47..40, tt47..40) 
|| AddOB(ts39. 32, ttgg..32) 
|| AddOB(ts31_ 24, tt31_ 24) 
|| AddOB(tso3. 16, tto3, 16) 
|| AddOB(ts15. 8; tt1s..8) 
|| AddOB(ts7..o, tt7..o) 
function AddOB(ts, tt) 
t < (0 || ts) + (0 || tt) 
if tg = 1 then 
AddOB < 18 
else 
AddOB <- t7.0 
endif 
end AddOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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ALNI.OB eat ALNI.OB 


Constant Alignment 


31 26 2524 23 2120 1615 1110 65 0 
MEDIA 0 Imm vt vs vd ALNI 
010010 00 011000 
6 2 3 5 5 5 6 
Format: 


ALNI.OB vd, vs, vt, imm 


Description: 


The align amount is computed by masking the immediate, then using that value to 
control a funnel shift of vector vs concatenated with vector vt. No immediate or 
scalar mode is available. 


No data-dependent exceptions are possible. The operands must be values in OB 
format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. This operation does not interpret the format of the 
registers specified. The result of this instruction is undefined if the processor is 
executing in 16 FP register mode. 


Operation: 
sc immo ol|0° 
if BigEndianCPU then 
FPR[vd] <— (FPR[vs] || FPR[vtl)127-s..64-s 
else 
FPR[vd] < (FPR[vs] || FPR[vt])63.s.. 
endif 
Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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AND.OB Vector AND AND.OB 
31 26 25 22 21 20 1615 1110 65 0 
MEDIA sel 0 vt VS vd AND 
010010 0 001100 
6 4 1 5 5 5 6 
Format: 


AND.OB vd, vs, vt 


Description: 


Each element of vector vs is combined with the corresponding element of vector 
vt in a bitwise logical AND operation. The se/ field selects the values of v¢[] used 


for each i. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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AND.OB Niantic AND.OB 


(continued) 


Operation: 


ts <— FPR[vs] 
tt — select(sel, vt) 
FPR[vd] <— AndOB(ts¢3. 56, ttes..s6) 

|| AndOB(tsss..48, ttss..48) 

|| AndOB(ts47..40, tt47..40) 
|| AndOB(tsgg_ 32, ttsg..32) 
|| AndOB(ts31_ 24, tts1_.24) 

|| AndOB(tse3. 16, tto3. 16) 

|| AndOB(ts15. 9, tts..s) 

|| AndOB(ts7..o, tt7..) 
function AndOB(ts, tt) 

AndOB € (0 || ts) and (0 || tt) 
end AndOB 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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CG. EQ.OB Vector Compare (Equal) Cc: EQ.OB 


31 26 25 22 21 20 1615 1110 65 0 
MEDIA sel 0 vt Vs 0 C.EQ 
010010 0 00000 000001 
6 4 1 5 5 5 6 
Format: 
C.EQ.OB vs, vt 
Description: 


The values in vector vt are compared to the values in vector vs and the result is 
written to the condition codes. All 8 CC bits are written with comparison results. 
The comparison made is equal (EQ). The inverse comparison (NE) is not 
necessary; the instructions that use condition codes (BCIF, BCIT, MOVF, 
MOVT, PICKF, PICKT) allow both CC = 0 and CC = | tests. The sel field selects 
the values of v¢[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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C.EQ.0B 


Operation: 


Vector Compare (Equal) 
(continued) 


C.EQ.0B 


ts <— FPR[vs] 
tt — select(sel, vt) 


SetFPConditionCode(7, (tSg3. 56 = tte. 56)) 
SetFPConditionCode(6, (ts55,.48 = tts55..48)) 
SetFPConditionCode(5, (ts47..49 = tt47..49)) 
SetFPConditionCode(4, (ts39, 32 = ttgg,.32)) 
SetFPConditionCode(3, (ts31.24 = tts1..24)) 
SetFPConditionCode(2, (tse3. 16 = ttes. 16)) 
SetFPConditionCode(1, (ts15. g = ttys. g)) 
SetFPConditionCode(0, (ts7..9 = tt7..9)) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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C.LE.OB eee C.LE.OB 


(Less Than or Equal) 


31 26 25 22 21 20 1615 1110 65 0 
MEDIA sel 0 vt VS 0 C.LE 
010010 0 00000 000101 
6 4 1 5 5 5 6 
Format: 
C.LE.OB vs, vt 
Description: 


The values in vector vt are compared to the values in vector vs and the result is 
written to the condition codes. All 8 CC bits are written with comparison results. 
The comparison made is less than or equal (LE). The inverse comparison (GT) is 
not necessary; the instructions that use condition codes (BCIF, BCIT, MOVF, 
MOVT, PICKF, PICKT) allow both CC = 0 and CC = | tests. The sel field selects 
the values of v¢[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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C.LE.OB Freee ohe C.LE.OB 


(continued) 


Operation: 


ts <— FPR[vs] 
tt — select(sel, vt) 


tSo3..16 < tts 16 
ts45..8 < tts. 8) 
ts7_9 < tt7..0)) 


SetFPConditionCode(7, (tS¢3. 56 < ttg3..56)) 
SetFPConditionCode(6, (ts55._ 48 < ttss..48)) 
SetFPConditionCode(5, (tS47..40 < tt47..40)) 
SetFPConditionCode(4, (ts39_ 30 < ttgg9..32)) 
SetFPConditionCode(3, (ts31_ 24 < ttsy.24)) 
SetFPConditionCode(2, ( )) 

(1, ( 

(0, ( 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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C.LT.OB eae C.LT.OB 


31 26 25 22 21 20 1615 1110 65 0 
MEDIA sel 0 vt VS 0 C.LT 
010010 0 00000 000100 
6 4 1 5 5 5 6 
Format: 
C.LT.OB vs, vt 
Description: 


The values in vector vt are compared to the values in vector vs and the result is 
written to the condition codes. All 8 CC bits are written with comparison results. 
The comparison made is less than(LT). The inverse comparison (GE) is not 
necessary; the instructions that use condition codes (BCIF, BCIT, MOVF, 
MOVT, PICKF, PICKT) allow both CC = 0 and CC = | tests. The sel field selects 
the values of v¢[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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C.LT.OB ea C.LT.OB 


(continued) 


Operation: 
ts < FPR[vs] 
tt — select(sel, vt) 
SetFPConditionCode(7, (ts63. 56 < ttg3..56)) 
SetFPConditionCode(6, (ts55..48 < tts5..48)) 
SetFPConditionCode(5, (tS47..49 < tta7..40)) 
SetFPConditionCode(4, (ts39_ 32 < ttgg..32)) 
SetFPConditionCode(3, (ts31_24 < tts1.24)) 
SetFPConditionCode(2, (tse3. 16 < tte, 16) 
SetFPConditionCode(1, (ts45_g < tty5. 8) 
SetFPConditionCode(0, (ts7..9 < tt7..9)) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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Multimedia Instruction Set 


MAX.OB 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt vs vd MAX 
010010 0 000111 
6 4 1 5 5 5 6 
Format: 


MAX.OB vd, vs, vt 


Description: 


The values in vector vt are compared to the values in vector vs and the larger is 
written to each element of vector vd. The sel field selects the values of vt[] used 


for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 


vectors become undefined. The result of this instruction is undefined if the 


processor is executing in 16 FP register mode. 
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MAX.OB Vector Maximum 


(continued) 


Operation: 


MAX.OB 


ts — FPRIvs] 
tt — select(sel, vt) 
FPR[vd] <- MaxOB(ts¢3..56, ttes..56) 
|| MaxOB(tss5, 4g, ttss..48) 
|| MaxOB(ts47..40, tt47..40) 
|| MaxOB(ts39_ 39, ttg9..32) 
|| MaxOB(ts31.24, ttsy..24) 
|| MaxOB(tso3 16, ttes 16) 
|| MaxOB(ts15.g, tts..8) 
|| MaxOB(ts7. 0, tt..0) 
function MaxOB(ts, tt) 
if (0 || ts) > (0 || tt) then 
MaxOB < ts 
else 
MaxOB < tt 
endif 
end MaxOB 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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MIN.OB 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt vs vd MIN 
010010 0 000110 
6 4 1 5 5 5 6 
Format: 


MIN.OB vd, vs, vt 


Description: 


The values in vector vt are compared to the values in vector vs and the smaller is 
written to each element of vector vd. The sel field selects the values of vt[] used 


for each i. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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MIN.OB Vector Minimum MIN.OB 


(continued) 


Operation: 


ts <— FPR[vs] 
tt — select(sel, vt) 
FPR[vd] < MinOB(ts¢s. 56, ttes..s6) 
|| MinOB(tss5..4g, tts5..48) 
|| MinOB(ts47, 40, tt47..40) 
|| MinOB(ts3g_ 32, tt39..32) 
|| MinOB(ts34_.24, tts1_.24) 
|| MinOB(ts23, 46; ttes..16) 
|| MinOB(ts15_ 8, ttys. 8) 
|| MinOB(ts7. o, ttz..o) 
function MinOB(ts, tt) 
if (O || ts) < (0 || tt) then 
MinOB < ts 
else 
MinOB < tt 
endif 
end MinOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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MUL.OB Vector Multiply MUL.OB 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt Vs vd MUL 
010010 0 110000 
6 4 1 5 5 5 6 
Format: 


MUL.OB vd, vs, vt 


Description: 


The values in vector vt are multiplied by the values in vector vs and the product is 
written into vector vd. Saturated arithmetic is performed: overflows and 
underflows clamp to the largest or smallest representable value before writing to 
vector vd. The sel field selects the values of vt[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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M U L . O B Vector Multiply 


(continued) 


Operation: 


MUL.OB 


ts <— FPR[vs] 

tt — select(sel, vt) 

FPR[vd] <- MulOB(ts¢3. 56; ttes..56) 
|| MulOB(tss5. 4g; ttss, 48) 
|| MulOB(ts47, 40, tta7..40) 
|| MulOB(tsgg, 32, ttgg, 32) 
|| MulOB(ts31_24, tts1, 24) 

|| MulOB(ts23, 46, tte3, 16) 
|| MulOB(ts15, 9, tt15. 8) 
|| MulOB(ts7. 0, tt7..o) 

function MulOB(ts, tt) 

t — (08 || ts) x (0° || tt) 
if tis g# 08 then 
MulOB < 18 
else 
MulOB < t7. 9 
endif 
end MulOB 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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MULA.OB 


MULA.OB vs, vt 


Description: 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt Vs A MULA 
010010 0) 00000 110011 
6 4 1 5 5 5 6 

Format: 


The values in vector vt are multiplied by the values in vector vs and the product is 


added to the Accumulator. Wrapped arithmetic is performed: overflows and 
underflows wrap around the Accumulator’s representable range before being 


written into the Accumulator. The Accumulator is in the OB format. The se/ field 


selects the values of v¢[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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MU LA.OB sain ee ae MU LA.OB 


(continued) 


Operation: 


ts <— FPR[vs] 

tt — select(sel, vt) 

ACC < AccMulOB(ACC 91.168, tS63..56, ttes..56) 
|| AccMulOB(ACC 467, 144, tS55..48, ttss..48) 
|| AccMulOB(ACC 143. .120; t$47..40, tt47..40) 
|| AccMulOB(ACC 1149. 96, tSag..3 ; ttsg..32) 
|| AccMulOB(ACCo¢, 72, tS31..24, tts1..24) 
|| AccMulOB(ACC7,_ 4g, tS23..16, tte3. 16) 
|| AccMulOB(ACCa7,_ 94, tS15..8, tt15..8) 
|| AccMulOB(ACCo3 6, tS7..0, tt7..0) 

function AccMulOB(a, ts, tt) 

AccMulOB < a + (0! || ts) x (018 || tt) 
end AccMulOB 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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MULL.OB Vector Multiply and MULL.OB 


Load Accumulator 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt Vs L MULL 
010010 0 10000 110011 
6 4 1 5 5 5 6 

Format: 


MULL.OB vs, vt 


Description: 


The values in vector vt are multiplied by the values in vector vs and the product is 
stored in the Accumulator. Wrapped arithmetic is performed, such that overflows 
and underflows wrap around the Accumulator’s representable range before being 
written into the Accumulator. The Accumulator result is in the OB format. The sel 
field selects the values of v¢[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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MULL.OB ae eenina ee MULL.OB 


(continued) 


Operation: 


ts <— FPR[vs] 

tt — select(sel, vt) 

ACC < AccMulOB(0"%, ts¢3. 56, ttes..56 
|| AccMulOB(0™, tsss. 48; ttss..48 
|| AccMulOB(0™, ts47. 40, tt47..40 

|| AccMulOB(0™, tsgq, 32, ttgg..32 

|| AccMulOB(0, ts34. 94, tts1. 94 

|| AccMulOB(0™, ts23. 16; ttes..16 

|| AccMulOB(0%, ts45, g, tts.) 
|| AccMulOB(0*%, ts7_ 0, tt7..o) 
function AccMulOB(a, ts, tt) 

AccMulOB < a + (0! || ts) x (01 || tt) 
end AccMulOB 


weweaw~eweaewow we 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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MULS.OB 


MULS.OB vs, vt 


Description: 


31 26 25 22 21 20 1615 1110 65 0 
MEDIA sel 0 vt VS S MULS 
010010 0 00000 110010 
6 4 1 5 5 5 6 

Format: 


The values in vector vt are multiplied by the values in vector vs and the product is 
subtracted from the Accumulator. Wrapped arithmetic is performed: overflows 

and underflows wrap around the Accumulator’s representable range before being 
written into the Accumulator. The Accumulator is in the OB format. The sel field 


selects the values of v¢[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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Vector Multiply and 
MU LS.OB Subtract eamieis MU LS.OB 


(continued) 


Operation: 


ts <— FPR[vs] 
tt — select(sel, vt) 
ACC © SubMulOB(ACC 491.168, tS63..56; ttes..5 ) 
|| SubMulOB(ACC 467, 144, tS55..48, ttss..48) 
|| SubMulOB(ACC 443.120; t$47..40, tta7..40) 
|| SubMulOB(ACC 419. 96, tSgg..32, ttgg..32) 
|| SubMulOB(ACCg5,_ 72, tS31..24, ttg1..24) 
|| SubMulOB(ACC7,_ 4g, tS23..16, tte3..16) 
|| SubMulOB(ACCy7, 94, tS15..8, tt15..8) 
|| SubMulOB(ACCo3. 9, t87.o, tt.) 
function SubMulOB(a, ts, tt) 
SubMulOB < a - (0'° || ts) x (07S || tt) 
end SubMulOB 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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MULSL.OB 


Vector Multiply 
Subtract and Load 


Multimedia Instruction Set 


MULSL.OB 


MULSL.OB vs, vt 


Description: 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt Vs L MULSL 
010010 0 10000 110010 
6 4 1 5 5 5 6 

Format: 


The values in vector vf are multiplied by the values in vector vs and negated. The 
vector result is stored to the Accumulator. The Accumulator result is in the OB 


format. The sel field selects the values of ve[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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MULSL.OB ss guttectandtoa «€©60|C$ MULSL.OB 


(continued) 


Operation: 


ts <— FPR[vs] 

tt — select(sel, vt) 

ACC < SubMulOB(0"", tsgs. 56, ttes..56 
|| SubMulOB(0**, ts55,_49, ttss..48 
|| SubMulOB(0**, ts47_49, tta7..40 
|| SubMulOB 


( 

( 

( 

(0*4, tsg9, 32. tts9. 32 
|| SubMulOB(0*, ts34_ 04, tts1_04 

( 

( 

( 


weweaw~wweaewowo we 


|| SubMUlOB(0*, tso3. 46; tto3..16 

|| SubMulOB(0%, ts15..g, ttrs. 8) 

|| SubMulOB(0*, ts7_ 9, tz.) 
function SubMulOB(a, ts, tt) 

SubMulOB < a - (0'8 || ts) x (0'8 |j tt) 
end SubMulOB 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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Multimedia Instruction Set 


NOR.OB Vector NOR NOR.OB 
31 26 25 22 21 20 1615 11 10 65 0 
MEDIA sel 0 vt vs vd NOR 
010010 0 001111 
6 4 1 5 5 5 6 

Format: 


NOR.OB vd, vs, vt 


Description: 


Each element of vector vs is combined with the corresponding element of vector 
vt in a bitwise logical NOR operation. The sel field selects the values of v¢[] used 


for each i. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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NOR.OB Vector NOR 


(continued) 


Operation: 


NOR.OB 


ts <— FPR[vs] 
tt — select(sel, vt) 
FPR[vd] < NorOB(ts¢3. 56; ttes..56) 
|| NorOB(tssg_.48, ttss..48) 
|| NorOB(ts47._.40, tt47..40) 
|| NorOB(tsgg, 32, ttg9. 32) 
|| NorOB(ts3y,.24, tts1..24) 
|| NorOB(ts23, 16, tta3,.16) 
|| NorOB(ts45, g, tty5..s) 
|| NorOB(ts7..0, tt7..0) 
function NorOB(ts, tt) 
NorOB < (0 || ts) nor (0 || tt) 
end NorOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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Multimedia Instruction Set 


OR.OB Vector OR OR.OB 
31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt VS vd OR 
010010 0 001110 
6 4 1 5 5 5 6 
Format: 


OR.OB vd, vs, vt 


Description: 


Each element of vector vs is combined with the corresponding element of vector 
vt in a bitwise logical OR operation. The se/ field selects the values of v¢[] used 


for each i. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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OR.OB Vector OR 


(continued) 


Operation: 


OR.OB 


ts <— FPR[vs] 

tt — select(sel, vt) 

FPR[vd] < OrOB(ts¢s. 56; ttes..56) 
|| OrOB(tss5..4 , ttss. 48) 
|| OrOB(ts47..4 ; tt47..40) 

|| OrOB(tsg9_.3 , ttgg..32) 

|| OrOB(ts31,2 , tts1_.24) 

|| OrOB(ts23, 4, ttes. 16) 

|| OrOB(ts15,_g, ttys..g) 

|| OrOB(ts7. 0, tt7..o) 

function OrOB(ts, tt) 

OrOB < (0 || ts) or (0 || tt) 
end OrOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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PICKF.OB 


Vector Pick False 


Multimedia Instruction Set 


PICKF.OB 


PICKF.OB vd, vs, vt 


Description: 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt Vs vd PICKF 
010010 0 000010 
6 4 1 5 5 5 6 

Format: 


The vector vd is written with either the corresponding element of vector vs or the 
corresponding element of vector vt, depending on the state of the CC bits. All 8 
CC bits are used. The sel field selects the values of vt[] used for each i. 


Both PICKF and PICKT are necessary since the operands are not symmetrical; 
every element of vector vs is used, whereas the se/ field selects values of ve[] used 


for each i. 


No data-dependent exceptions are possible. The operands must be a value in OB 
format. If they are not, the results are undefined and the values of the operand 

vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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PICKF.OB = rrexrse PICK F.OB 


(continued) 


Operation: 


ts <— FPR[vs] 
tt — select(sel, vt) 
FPR[vd] <— PickOB(FPConditionCode(7) = 0, tsg3. 56, ttgs..56) 
|| PickOB(FPConditionCode(6) = 0, ts55, 4g, tts55..48) 
|| PickOB(FPConditionCode(5) = 0, ts47_.49, tt47..49) 
|| PickOB(FPConditionCode(4 =0, tS39. 32, ttg9..32) 
|| PickOB(FPConditionCode(3 ) 
|| PickOB(FPConditionCode(2) = 0, tso3._ 16, ttes. 16) 
|| PickOB(FPConditionCode(1) = 0, ts15, g, ttys. 8) 
|| PickOB(FPConditionCode(0) = 0, ts7. 9, tt7..9) 
function PickOB(c, ts, tt) 
if c then 
PickOB < ts 
else 
PickOB < tt 


) 
) 
) 
) = 0, ts31_ 24, tts1.24 
) 
) 
) 


endif 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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PICKT.OB 


Vector Pick True 


Multimedia Instruction Set 


PICKT.OB 


PICKT.OB vd, vs, vt 


Description: 


31 26 25 22 21 20 1615 1110 65 0 
MEDIA sel 0 vt Vs vd PICKT 
010010 0 000011 
6 4 1 5 5 5 6 

Format: 


The vector vd is written with either the corresponding element of vector vs or the 
corresponding element of vector vt, depending on the state of the CC bits. All 8 
CC bits are used. The sel field selects the values of vt[] used for each i. 


Both PICKF and PICKT are necessary since the operands are not symmetrical; 
every element of vector vs is used, whereas the se/ field selects values of ve[] used 


for each i. 


No data-dependent exceptions are possible. The operands must be a value in OB 
format. If they are not, the results are undefined and the values of the operand 

vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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PICKT.OB == *eerPekt™me = BICKT.OB 


(continued) 


Operation: 

ts <— FPR[vs] 

tt — select(sel, vt) 

FPR[vd] < PickOB(FPConditionCode(7) = 1, tsg3. 56, ttg3. 56) 
|| PickOB(FPConditionCode(6) = 1, ts55__48, tts55..48) 
|| PickOB(FPConditionCode(5) = 1, ts47_.49, tt47..40) 
|| PickOB(FPConditionCode(4) = 1, ts39_ 39, tts. 32) 
|| PickOB(FPConditionCode(3) = 1, ts31_ 24, tts1_24) 
|| PickOB(FPConditionCode(2) = 1, tso3._ 16, ttes. 16) 
|| PickOB(FPConditionCode(1) = 1, ts15, g, tts. 8) 
|| PickOB(FPConditionCode(0) = 1, ts7. 9, tt7..9) 

function PickOB(c, ts, tt) 
if c then 

PickOB < ts 
else 
PickOB < tt 
endif 
Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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Multimedia Instruction Set 


RACH.OB ea RACH.OB 


Accumulator High 


31 26 25 22 21 20 1615 1110 65 0 
MEDIA H 0 0 0 vd RACH 
010010 1000 |0} 00000 | 00000 111111 
6 4 1 5 5 5 6 

Format: 
RACH.OB vd 
Description: 


Read the most-significant third of the bits of the Accumulator elements. No 
clamping of the values extracted is performed; the bits are simply copied into 
elements of vd[]. 


RACL/RACM/RACH followed by WACL/WACH are used to save and restore 
the Accumulator. 


No data-dependent exceptions are possible. The result of this instruction is 
undefined if the processor is executing in 16 FP register mode. 


Operation: 


FPR[vd] <— ACC 494 184 
|| ACC167. 160 
|| ACC143. 136 
|| ACC149.412 
|| ACCg5. 88 
|| ACC71.64 
|| ACC47. 40 
|| ACC23..16 


Exceptions: 


Coprocessor Unusable exception 
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RACL.OB IRR RACL.OB 


Accumulator Low 


31 26 25 22 21 20 1615 11 10 65 0 
MEDIA L 0 0 0 vd RACL 
010010 0000 /0} 00000 | 00000 111111 
6 4 1 5 5 5 6 

Format: 
RACL.OB vd 
Description: 


Read the least-significant third of the bits of the Accumulator elements. No 
clamping of the values extracted is performed; the bits are simply copied into 
elements of vd{[]. 


RACL/RACM/RACH followed by WACL/WACH are used to save and restore 
the Accumulator. 


No data-dependent exceptions are possible. The result of this instruction is 
undefined if the processor is executing in 16 FP register mode. 


Operation: 


FPR[vd] < ACC 175, 168 
|| ACC 454.144 
|| ACC 1497, 420 
|| ACC 103.96 
|| ACC79,.72 
|| ACCs5,48 
|| ACC31.24 
|| ACC7. 9 


Exceptions: 


Coprocessor Unusable exception 


718 Vr5432 Microprocessor User’s Manual 


Multimedia Instruction Set 


RACM.OB ener RACM.OB 


Accumulator Middle 


31 26 25 22 21 20 1615 11 10 65 0 
MEDIA M 0 0 0 vd RACM 
010010 0100 |/0} 00000 | 00000 111111 
6 4 1 5 5 5 6 

Format: 
RACM.OB vd 
Description: 


Read the middle third of the bits of the Accumulator elements. No clamping of the 
values extracted is performed; the bits are simply copied into elements of vd[]. 


RACL/RACM/RACH followed by WACL/WACH are used to save and restore 
the Accumulator. 


No data-dependent exceptions are possible. The result of this instruction is 
undefined if the processor is executing in 16 FP register mode. 


Operation: 


FPR[vd] < ACCi83. 176 

|| ACC159. 152 

|| ACC 135, 128 

|| ACC1414..104 

|| ACC 87. 80 

|| ACC¢3. 56 

|| ACC 39. 32 

|| ACC15.8 


Exceptions: 


Coprocessor Unusable exception 
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RZ U O B Vector Scale, Round, 


and Clamp Accumulator 


RZU.OB 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt 0 vd RZU 
010010 0 00000 100000 
6 4 1 5 5 5 6 
Format: 
RZU.OB vd, vt 
Description: 


720 


The values in the Accumulator are logically shifted right by the values in vector 
vt, rounded to the nearest value with exactly halfway results rounded toward zero, 
and clamped to an unsigned subset of the range of va[]. The Accumulator is in the 
OB format. The sel field selects the values of vz[] used for each i. The shift amount 
must be an immediate and the value must be 0, 8, or 16. The clamping range is 


0..255. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 


processor is executing in 16 FP register mode. 


Vr5432 Microprocessor User’s Manual 


Multimedia Instruction Set 


Vector Scale, Round, 
RZU.OB and Clamp Accumulator RZU.OB 


(continued) 


Operation: 


tt — select(sel, vt) 
FPR[vd] <- RZUOB(ACC 493. 168, ttes..56) 
|| RZUOB(ACC 467. 144, tts5.48) 
|| RZUOB(ACC 443, 120, tt47..40) 
|| RZUOB(ACC 419. 96; ttsg..32) 
|| RZUOB(ACCgg, 72, tts1..24) 
|| RZUOB(ACC7_ 4g, tte3..16) 
|| RZUOB(ACCa7, 94, ttys5, 8) 
|| RZUOB(ACC23 9, tt7_.0) 
function RZUOB(a, s) 
if O || s > 23 then 
RZUOB «+ 0° 
else 
t— 0° || a23.5 
if O || t<0'” || 18 then 
RZUOB < t7.6 
else 
RZUOB + 18 
endif 
endif 
end RZUOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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SHFL.op.OB aces SHFL.op.OB 


31 26 25 22 21 20 1615 1110 65 0 
MEDIA sel 0 vt Vs vd SHFL 
010010 0) 011111 
6 4 1 5 5 5 6 

Format: 
SHFL.op.OB vd, vs, vt 
Description: 


Elements of vectors vs and vt are merged into a new vector. Not all combinations 
of values are available; the operations of the variants of this instruction are tailored 
to the data movement patterns of specific calculations. The shuffles available are 
given in Table 19-3. 


The sel field selects the values of vt[] used for each i. The se/ field must specify a 
vector, not an immediate or a scalar. The remaining bits in the field are not used 
for a vt[] select, but rather are used to encode the shuffle operation. 


Table 19-3 Operation Encoding for Shuffles 
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sel Operation | vd[7] | va[6] | va[5] | va[4] | va[3] | va[2] | vali] | va[o] 
0100 PACH | vs[7] | vs[5] | vs[3] | vs{1] | vet(7] | vtf5) | vet[3] | vt{l] 
0101 PACL | vs{6] | vs[4] | vs{2] | vsfo] | vtf6y | vef4] | vef2] | vefo] 
0110 MIXH | vs[7] | vti7] | vs{6] | vtf6) | vs[5) | vtls) | vs[4] | vt[4] 
O11 MIXL | vs[3] | ve(3] | vs{2] | ve(2] | vs(1] | veft] | vs(oy | vefoy 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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Vector 
Shuffle 
(continued) 


SHFL.op.OB 


Operation: 


Multimedia Instruction Set 


SHFL.op.OB 


PACH.OB 
ts <— FPR[vs] 
tt — select(sel, vt) 
FPR[vd] < ts¢g_5¢ || tS47..40 
|| tS31..24 || ts15..8 
|| ttes..56 || tta7.4 


|| tt31_24 || ttis.8 
PACL.OB 


ts <— FPR[vs] 

tt — select(sel, vt) 

FPR[vd] < ts55..48 || tS39..32 
|| tS23..16 || 87.0 
|| tts5..48 || ttsg..3 


|| tto3. 16 || tt7.0 
MIXH.OB 


ts <— FPR[vs] 

tt — select(sel, vt) 

FPR[vd] < ts¢g..56 || ttes..56 
|| ts55..48 || ttss..48 
|| t47..40 || tta7..40 
|| tSgg..32 || ttgg.32 
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SHFL.op.OB Shiie 


(continued) 


Operation (continued): 


SHFL.op.OB 


MIXL.OB 
ts <— FPR[vs] 
tt — select(sel, vt) 
FPR[vd] < ts31_.24 || ttg1.24 
|| tSo3..16 || ttes..16 
I| ts15..9 || ttis..¢ 
|| tS7_.9 || tt7..0 
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Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


Vr5432 Microprocessor User’s Manual 


SLL.OB 


Vector Shift Left Logical 


Multimedia Instruction Set 


SLL.OB 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt Vs vd SLL 
010010 0 010000 
6 4 1 5 5 5 6 
Format: 


SLL.OB vd, vs, vt 


Description: 


Each element of vector vs is shifted left by an amount specified by an immediate 
or an element of vector vt, and zeros are shifted into the low-order bits. The results 
are written into vector vd. All but the lower 3 bits of the shift amount are masked 
to 0, so the largest possible shift is 7 places. The se/ field selects the values of ve[] 


used for each i, which must be a scalar or an immediate. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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SLL.OB Vector Shift Left Logical SLL.OB 


(continued) 


Operation: 


ts — FPR(vs] 

tt — select(sel, vt) 

FPR[vd] <- SLLOB(tsg3..56, ttes..56) 
|| SLLOB(tss5,_48, ttss. 48) 
|| SLLOB(ts47..40, tta7,.40) 
|| SLLOB(ts39_ 39, ttgg. 32) 
|| SLLOB(tsg1 04, ttg1..24) 

|| SLLOB(ts23. 16, tte3. 16) 
|| SLLOB(ts15. 8; tt1s. 8) 
|| SLLOB(ts7, 0, tt7..0) 

function SLLOB(ts, tt) 
s<tto 0 
SLLOB < ts7-5.9 || 0° 

end SLLOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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SRL.OB 


Vector Shift Right Logical 


Multimedia Instruction Set 


SRL.OB 


31 26 25 22 2120 1615 1110 65 0 
MEDIA sel 0 vt VS vd SRL 
010010 0 010010 
6 4 1 5 5 5 6 
Format: 


SRL.OB vd, vs, vt 


Description: 


Each element of vector vs is shifted right by an amount specified by an immediate 


or an element of vector vt, and zeros are shifted into the low-order bits. The results 
are written into vector vd. All but the lower 3 bits of the shift amount are masked 
to 0, so the largest possible shift is 7 places. The se/ field selects the values of ve[] 


used for each i, which must be a scalar or an immediate. 


No data-dependent exceptions are possible. The operands must be values in the 

OB format. If they are not, the results are undefined and the values of the operand 
vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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SR L.OB Vector Shift Right Logical SR L.OB 


(continued) 


Operation: 


ts — FPR([vs] 

tt — select(sel, vt) 

FPR[vd] <- SRLOB(ts¢3. 56; ttes..56) 
|| SRLOB(tss5. ag, ttss,.48) 
|| SRLOB(ts47. 40, tt47..40) 
|| SRLOB(tsgg_ 39, tt39..32) 
|| SRLOB(ts31, 24, ttg1_24) 

|| SRLOB(tsg3, 146, ttes..16) 
|| SRLOB(ts15,_g; tty5..8) 
|| SRLOB(ts7_o, tt .o) 

function SRLOB(ts, tt) 
s<tto 0 
SRLOB < 08 || ts7.5 
end SRLOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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Multimedia Instruction Set 


SU B.OB Vector Subtract SU B.OB 


31 26 25 22 21 20 1615 1110 65 0 
MEDIA sel 0 vt VS vd SUB 
010010 0 001010 
6 4 1 5 5 5 6 
Format: 


SUB.OB vd, vs, vt 


Description: 


The difference of the values in vector vt and vector vs is written into vector vd. 
Saturated arithmetic is performed: overflows and underflows clamp to the largest 
or smallest representable value before writing to vector vd. The sel field selects 
the values of v¢[] used for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
specified format. If they are not, the results are undefined and the values of the 
operand vectors become undefined. The result of this instruction is undefined if 
the processor is executing in 16 FP register mode. 
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SU B.OB Vector Subtract SU B.OB 


(continued) 


Operation: 


ts <— FPR[vs] 
tt — select(fmtsel, vt) 
FPR[vd] < SubOB(ts¢3. 56, ttes..56) 
|| SubOB(tss5,_ 4g, tts55..48) 
|| SuBOB(ts47..40, tt47..40) 
|| SUBOB(ts39_ 32, ttgg..32) 
|| SUDOB(ts31 94, ttg1..24) 
|| SubOB(ts23. 16, ttas.16) 
|| SubOB(ts 15.8, tts. 8) 
|| SubOB(ts7. 0, ttz..0) 
function SubOB(ts, tt) 
t <— (0 || ts) - (0 || tt) 
if tg = 1 then 
SubOB < 08 
else 
SubOB <— t7..0 
endif 
end SubOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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Multimedia Instruction Set 


WACH.OB aan WACH.OB 


Accumulator High 


31 26 25 22 21 20 1615 11 10 65 0 
MEDIA H 0 0 vs 0 WACH 
010010 1000 |0} 00000 00000 111110 
6 4 1 5 5 5 6 

Format: 
WACH.OB vs 
Description: 


This instruction writes the most-significant third of the bits of the Accumulator 
elements. The least-significant two-thirds of the bits of the Accumulator elements 
are unaffected. 


RACL/RACM/RACH followed by WACL/WACH are used to save and restore 
the Accumulator. 


This instruction is the only instruction that writes a portion of the Accumulator. 
WACL writes all bits in the accumulator, so it must precede WACH when 
restoring the Accumulator. 


No data-dependent exceptions are possible. The result of this instruction is 
undefined if the processor is executing in 16 FP register mode. 


Operation: 


ACC < FPRIvs]63..56 || ACC 183.168 


|| FPR[vs]55,.48 || AC 459.144 


|| FPR[vs]47..40 || AC 135.120 
|| FPR[vs]39,.32 || AC 441.96 


|| FPR[vs]31..24 || AC 87.72 
|| FPR[vs]o3..16 || AC 63..48 
|| FPR[vs]15..8 || AC 39.24 
|| FPR[vs]7..9 || ACC 15.0 
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WAC H E O B Vector Write 


Accumulator High 
(continued) 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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Multimedia Instruction Set 


WACL.OB oe WACL.OB 


Accumulator Low 


31 26 25 22 21 20 1615 11 10 65 0 
MEDIA L 0 vt Vs 0 WACL 
010010 0000 |0 00000 111110 
6 4 1 5 5 5 6 


Format: 
WACL.OB vs, vt 


Description: 


This instruction writes the least-significant two-thirds of the bits of the 
Accumulator elements. The upper one-third of the bits of the Accumulator 
elements are written by the sign bits of the corresponding elements of vector vs[] 
and replicated by 8, depending on the format. 


RACL/RACM/RACH followed by WACL/WACH are used to save and restore 
the Accumulator. 


No data-dependent exceptions are possible. The result of this instruction is 
undefined if the processor is executing in 16 FP register mode. 


Operation: 


ACC < 0° || FPRIvsle..56 Il FPRIvtles..56 

|| 0° || FPRIvslss. 48 || FPR[vtlss..48 
|| 0° || FPR[vs]47_ 40 || FPR[vt]47..40 
|| 0° || FPR[vs]s9. 32 || FPRIVtlgg. 32 
|| 0° || FPRIvs]s1.24 || FPR[vt}31_24 
|| 0° || FPRIvslo3. 16 Il FPRIvtlos. 16 
|| 0° || FPRIvshis..s || FPRIVtlis.s 

|| 0° || FPRIvsl7..o |] FPRIvtl..0 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 
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XOR.OB Vector XOR XOR.OB 
31 26 25 22 21 20 1615 1110 65 
MEDIA sel 0 vt vs vd XOR 
010010 0 001101 
6 4 1 5 5 5 6 
Format: 


XOR.OB vd, vs, vt 


Description: 


734 


Each element of vector vs is combined with the corresponding element of vector 
vt in a bitwise logical XOR operation. The sel field selects the values of vz[] used 
for each i. 


No data-dependent exceptions are possible. The operands must be values in the 
OB format. If they are not, the results are undefined and the values of the operand 


vectors become undefined. The result of this instruction is undefined if the 
processor is executing in 16 FP register mode. 
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XOR.OB pect aet XOR.OB 


(continued) 


Operation: 


ts <— FPR[vs] 
tt — select(sel, vt) 
FPR[vd] < XorOB(ts¢3._56, ttes..56) 
|| XorOB(ts55..4, tts5,.48) 
|| XorOB(ts47..40, tt47..40) 
|| XorOB(tsgg..32, ttzg9..32) 
|| XorOB(tsg1 24, ttg1..24) 
|| XorOB(tSa3. 46, tto3. 16) 
|| XorOB(ts15,_ , tty5..8) 
|| XorOB(ts7, 0; tt7_.0) 
function XorOB(ts, tt) 
XorOB < (0 || ts) xor (0 || tt) 
end XorOB 


Exceptions: 


Coprocessor Unusable exception 
Reserved Instruction exception 


19.4 Multimedia Instruction Opcode Bit Encoding 


Figure 19-6 lists the bit encoding for multimedia instructions. 
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Function (for Opcode = COP2) 
bits 2...0 
53 0 1 2 3 4 5 6 7 
0 C.EQ PICKF | PICKT C.LT C.LE MIN MAX 
1 SUB ADD AND XOR OR NOR 
2 SLL SRL 
3 ALNI SHFL 
4 RZU 
5 
6 MUL MULS{,L}) MUL{A,L} 
7 WAC{H,L}/RAC{H,L} 


Figure 19-6 Bit Encoding for Multimedia Instructions 
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20 


This chapter describes the VR5432 processor’s debug and test functions, which 
are intended for the exclusive use of debug software and hardware tools. These 
functions do not involve the WatchLo and WatchHi registers; instead, they replace 
and greatly improve on the debug functions implemented by the WatchLo and 
WatchHi registers. 


The debug and other JTAG-accessible registers described here are not 
architecturally visible parts of the processor. Programs running in Normal mode 
(User, Supervisor, or Kernel mode) cannot access the debug resources directly. 
However, a special Normal mode instruction, DBREAK, is provided for accessing 
Debug mode, and a debug tool attached to the JTAG port can access Debug mode 
directly. 
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20.1 Overview 


The processor implements both internally and externally accessible debug 
resources. The externally accessible resources are accessed via the JTAG 
interface, which complies with IEEE Standards 1149.1 and 1149.1a and 
implements N-Wire and N-Trace debug enhancements. 


The processor’s Debug mode is entered when a debug break occurs. The debug 
functions can be controlled internally or externally, as follows: 


¢ Internal Access. A processor-resident debugger program, invoked via 
the Debug Exception vector, uses debug instructions to access the 
processor’s debug registers. The DBREAK instruction is provided fo 
this purpose; when executed, it causes the processor to enter Debug 
mode. 


¢ External Access. An external debug tool, attached to the JTAG test 
access port (TAP), can access the processor's internal debug module, 
which includes JTAG-accessible registers that support JTAG, N-Wire, 
and N-Trace test interfaces. 


Figure 20-1 shows the processor resources accessible to debug programs and 
external debug tools using the internal- and external-access methods. 


VR5432 Processor 


Normal Mode Fea 
Resources Debug JTAG- se 


Registers 


Accessible 
Registers 


Internal-Access Resources 


External-Access Resources 


Figure 20-1 Access to Processor Resources in Debug Mode 
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The debug registers—DRO through DR15—can be accessed by the debug 
instructions—DBREAK, MTDR, MFDR, and DRET—1in either the internal- or 
external-access Debug mode. These registers and instructions give software or 
hardware the ability to break the processor, modify its state or set breakpoints, and 
resume running, or to break the processor, single-step, and resume running. 


The N-Wire and N-Trace interfaces, available in external-access Debug mode, 
support comprehensive hardware and software breakpoints and trace functions. 
They use a monitor mechanism that gives debug tools access to all system 
resources, including the processor’s user and debug registers, program counter, 
register file, caches, external memory, and I/O. For example, an external-access 
debug tool can download data via the JTAG port into external memory, return to 
normal operation mode, and monitor the result of execution using this data. The 
N-Wire functions provide run-time control and access to the processor’s internal 
state. The N-Trace functions support instruction-execution tracing via trace 
packets on the trace signals. Both functions share a set of JTAG-accessible 
registers. In the VR5432 implementation, “N” equals 4, as represented by the four 
TrcData [3:0] signals. 


Because an external-access debug tool can access both the debug registers and the 
JTAG-accessible registers, the external-access Debug mode provides more 
control of the processor than does internal-access Debug mode. 


20.2 Definition of Terms 


Debug Break. An event that causes the processor to asynchronously leave 
Normal mode (User, Supervisor, or Kernel mode) execution and enter Debug 
mode. The terms “break” and “debug break” are used interchangeably. Section 
20.3 defines all possible debug break events. 


Debug Exception Vector. Address OxFFFF FFFF BFCO 1000. The DBREAK 
instruction is designed for accessing this vector. 


Debug Instructions. DBREAK, MTDR, MFDR, and DRET, as described in 
Section 20.4.1. 


Debug Mode. The processor enters Debug mode as the result of a debug break. If 
the debug module is in reset at the debug break, the processor begins executing 

internal-access resident debugger instructions, starting at the Debug Exception 

vector address. If the debug module is not in reset at the debug break, the processor 
begins executing external-access instructions from the JTAG-accessible N- Wire 
Monitor Instruction (MON_INST) register. Although Debug mode can be entered 
and controlled via internal or external access, external access supports maximum 
control of the processor. See also Normal Mode, Debug Module, and Section 20.3. 
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Debug Mode Registers. The internally accessible debug registers and the JTAG- 
accessible registers. 


Debug Module. A module inside the processor that supports external access to the 
debug features via the JTAG port. The debug module contains the JTAG, N-Wire, 
and N-Trace interfaces. See Figure 20-1. 


Debug Module Reset. The processor state in which external access to the 
processor’s debug module is disabled. This reset is unrelated to the processor reset 
(Reset*). The debug module is enabled and disabled with the D/NIT bit in the 
JTAG-accessible N-Wire Debug Module System (DM_SYSTEM) register. In the 
internal-access method, the debug module is disabled (in reset), and the processor 
can enter Debug mode by executing the DBREAK instruction. In the external- 
access method, the debug module is enabled (not in reset). Compare to Debug 
Reset. 


Debug Registers. The registers accessible with the debug instructions. These 
registers include DRO through DR15 (DRCNTL, DEPC, DDATAO, DDATA1, 
IBC, DBC, IBA, IBAM, DBA, DBAM, DBD, and DBDM). All of the debug 
registers are accessible directly in the internal-access Debug mode, and they are 
accessible either directly or indirectly in the external-access Debug mode. The 
debug registers are described in Section 20.4.2. These registers overlap (share 
registers or copy register bits) with the JTAG-accessible registers described in 
Section 20.5.2. 


Debug Reset. A reset to the processor from the debug module, accomplished by 
externally setting the RESET bit in the N-Wire Debug Module Control 
(DM_CONTROL) register. The effect of a debug reset on the processor is the 
same as asserting Reset*. Compare to Debug Module Reset. 


External Access. Debug access to processor resources and operations by an 
external debug tool through the JTAG port and the on-chip debug module, which 
supports JTAG, N-Wire, and N-Trace debug functions (see Figure 20-1). In the 
external-access method, a debug tool can access all of the debug registers and all 
of the JTAG-accessible registers. External access thus provides more control of 
processor resources than does internal access. 


Hardware Breakpoint. An instruction address, data address, or data-data 
breakpoint specified in the debug registers or the JTAG-accessible registers. The 
hardware breakpoint registers are shared between the debug and JTAG-accessible 
register sets. 
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Internal Access. Debug access to processor resources and operations via a 
resident debugger program invoked at the processor’s Debug Exception vector 
address (the DBREAK instruction is provided for this purpose). The resident 
debugger program can use the debug instructions MTDR and MFDR to access the 
processor state, set breakpoints, and single-step. 


JTAG-Accessible Registers. The registers accessible in external-access Debug 
mode. They include the three required JTAG registers (Instruction, Bypass, and 
Boundary Scan), plus registers to support the N-Wire and N-Trace debug 
functions (DM_SYSTEM, DM_CONTROL, MON_INST, MON_DATA, 
TRCSYS, and most of the internal-access debug registers). These registe rsare 
described in Section 20.5.2. 


Monitor. A JTAG-accessible mechanism for accessing all system resources, 
including the processor’s Normal mode and Debug mode registers, cache, 
external memory, and I/O. 


Normal Mode. User, Supervisor, or Kernel mode. The processor is also in 
Normal mode when it is in Reset or is being reset by the debug module. See also 
Debug Mode. 


Resident Debugger. An optional program that can be accessed internally via the 
Debug Exception vector (the DBREAK instruction is designed for this purpose). 
This program provides system access to most (but not all) of the processor’s debug 
features when there is no attached debug tool or the debug tool is in Reset. 


Trigger. The BkTgIO* output signal. 


Trigger Event. An event that causes assertion of the BkTgIO* output signal. Such 
events can include: 


e An enabled hardware breakpoint 


e An enabled debug break 
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20.3 Debug Mode 


The processor enters Debug mode as a result of one of the following possible 
debug break events: 


e Internal-access debug break events 
- Execution of the DBREAK instructio 
- Setting th STEP bit in the DRCNTL debug register (DRO) 


- Reaching an instruction-address, data-address, or data-data 
breakpoint specified in the debug registers 


e External-access debug break events 
- Assertion o the BkTgIO* signal, when it is configured for inpu 


- Setting of th BREAK bit in the JTAG-accessible N-Wire Debug 
Module Control (DM_CONTROL) register 


- Setting th STEP bit in either the DRCNTL register or th 
DM_CONTROL register 


- Reaching an instruction-address, data-address, or data-data 
breakpoint specified in the debug registers 


Debug mode is entered regardless of the state of the debug module. 


e Ifthe debug modul is in Reset (DINIT bit set to 1 in the N-Wire 
DM_SYSTEM register), the processor begins executing internal- 
access resident debugger instructions starting at the Debug Exception 
vector address. In this case, the DRCNTL register controls Debug 
mode operations. 


¢ If the debug modul is not in Reset (DINIT bit cleared to 0 in the N- 
Wire DM_SYSTEM register), the processor begins executing 
external-access instructions from the N-Wire Monitor Instruction 
(MON_INST) register, if execution is enabled. In this case, the JTAG 
port controls Debug mode operations. 


When Debug mode is entered, all incomplete instructions are flushed from the 
pipeline, all outstanding external bus transactions are completed, execution 
transitions to Debug mode at an instruction boundary, the program counter (PC) 
is saved in the DEPC debug register, and execution is redirected to the 64-bit 
Debug Exception vector (location OxFFFF FFFF BFCO 1000). There may be a 
delay entering Debug mode to allow the pipeline flush and to allow all outstanding 
external transactions to complete; if so, the processor stalls during this time. 
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The processor will not enter Debug mode at a branch delay slot instruction 
boundary. Instead it stops at the branch instruction or the target of the branch. If a 
software or hardware breakpoint occurs for the branch delay slot instruction, the 
breakpoint occurs at the corresponding Branch instruction. If a single-step break 
is executed on a Branch instruction, both the branch and its delay slot are 
executed. 


Instructions that redirect the PC (e.g., branches) are not allowed to be executed in 
the MON_INST register when the debug module is in reset. Any attempt to do so 
results in undefined behavior. Instructions that redirect the PC are allowed if the 
debug module is not in reset. 


While in Debug mode, the processor behaves as if it is in Kernel mode (CPO Status 
EXL = 1), although entering Debug mode does not set the EXL bit. All interrupts 
are disabled, including NMI*, and any debug break events are ignored. If a Load 
or Store instruction causes an exception in Debug mode, the exception is 
processed as if the processor is in Kernel mode. The DM_EXCEPT bit in the 
relevant Debug Control register (DRCNTL for internal access, or 
DM_CONTROL for external access) indicates whether an exception occurred. If 
any instruction other than Load or Store causes an exception, the results and 
processor state are undefined. 


The processor returns to Normal mode from Debug mode by executing a DRET 
instruction. The processor vectors the PC to the address in the Debug Exception 
PC (DEPC) register. 


20.4 Internal Access 


In the processor’s internal-access Debug mode, a resident debugger program can 
use the debug instructions to access all of the debug registers. These instructions 
and registers are (with a few exceptions) also available to external-access Debug 
mode, as described in Section 20.5. 
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20.4.1 


20.4.1.1 


Debug Instructions 


The DBREAK, DRET, MTDR and MFDR instructions are unique to the 
processor’s debug features. Except for DBREAK, these instructions are accessible 
only when the processor is in Debug mode; executing them in Normal mode 
causes a Reserved Instruction trap. 


DBREAK: Debug Break 


31 26 25 6 5 0 


SPECIAL2 
011100 


0 DBREAK 
0000 0000 0000 0000 0000 411111 


6 


20 6 


20.4.1.2 


The DBREAK instruction forces entry into Debug mode by causing a trap to the 
Debug Exception vector address (OxFFFF FFFF BFCO 1000). This instruction 
may only be executed in Normal (User, Supervisor, or Kernel) mode. Execution 
in Debug mode results in undefined behavior. 


DRET: Debug Return 


31 26 25 6 5 0 


SPECIAL2 
011100 


0 DRET 
0000 0000 0000 0000 0000 411710 


6 


20 6 
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The DRET instruction returns from Debug mode to the mode in effect (User, 
Supervisor, or Kernel mode) when the last debug break occurred. Control is 
passed to the instruction pointed to by the Debug Exception PC (DEPC) register. 
Unlike most jumps and branches, the execution of which also executes the next 
instruction (the one in the delay slot), DRET does not execute a delay slot 
instruction. The DRET instruction must not be placed in a branch delay slot. 
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20.4.1.3 MTDR: Move to Debug Register 
31 26 25 21 20 16 15 11 10 65 
SPECIAL2 MTDR rt dr 0 Debug Move 
011100 | 00100 00000 111101 
6 5 5 5 5 6 
This instruction moves the contents of general register rf into debug register dr. 
20.4.1.4 MFDR: Move from Debug Register 
31 26 25 21 20 16 15 11 10 65 
SPECIAL2 MFDR rt dr 0 Debug Move 
011100 | 00000 00000 111101 
6 5 5 5 5 6 
This instruction moves the contents of debug register dr into general register rt. 
20.4.2 Debug Registers 


Table 20-1 lists the debug registers. The DME bit in the CPO Status register is only 
accessible in Normal mode via Normal mode instructions. All of the debug 
registers except the DME bit are accessible in both the internal-access and 
external-access Debug modes via the MFDR and MTDR instructions. Unless 
otherwise specified, the contents of the debug registers are undefined after a 
processor cold reset. 
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Table 20-1 Debug Registers 


‘ ; Register ‘ 
Register Register Name Register Width Restore 
Mnemonic Number ‘ Membership 
(Bits) 
DME DME bit in the CPO Status register —_ 1 Internal 
DRCNTL | Debug Register Control register DRO 32 Internal 
DEPC Debug Exception PC register DR1 64 Internal 
DDATAO Debug Data Monitor 0 and Monitor Data DR? 64 Internal and 
register external 
DDATAI1 | Debug Data Monitor | register DR3 64 Internal 
IBC Instruction Breakpoint Control/Status DR4 32 Internal and 
register external 
DBC Data Breakpoint Control/Status register DRS5 32 Palernene 
external 
— Reserved DR6 — 
— Reserved DR7 — 
IBA Instruction Breakpoint Address register DR8 642 Piternatane 
external 
IBAM Instruction Breakpoint Address Mask DRO 642 Internal and 
register external 
— Reserved DRI1O — 
—_ Reserved DRI1 — 
DBA Data Breakpoint Address register DR12 64 stir cs 
external 
DBAM Data Breakpoint Address Mask register DR13 64 Pieraane 
external 
DBD Data Breakpoint Data register DR14 64 Piearane 
external 
DBDM _|Data Breakpoint Data Mask register DRIS | 64 ene 
external 
Notes: 


1. All debug registers except DME are accessible in both the internal-access and external-access Debug 
modes via the MFDR and MTDR instructions. However, the registers marked “internal and external” are 
actually shared by the internal-access and external-access register sets. 

2. Only 40 bits of the virtual address, plus the region bits (63:62), are compared. The unused address bits 
must be sign extended to bit 61 for all address spaces, except xkphys. For xkphys address space, bits 
61:59 must also indicate the correct cacheability attribute, because these bits are compared. 


746 


Vr5432 Microprocessor User’s Manual 


Debug and Test Features 


20.4.2.1 Debug Mode Enable (DME) bit in the CPO Status register 


Bit 24 is the Debug Mode Enable (DME) bit in the Diagnostic Status (DS) field of 
the CPO Exception Processing Status register (see Section 6.2.5 on page 97). It 
indicates to the processor that there is a resident debugger program at the Debug 
Exception vector. The bit is only accessible in Normal mode via Normal mode 
instructions, and it is only meaningful when the debug module is in reset. 


¢ DME = 0: A debug break event does not cause the processor to enter 
Debug mode. The DBREAK instruction causes a Reserved 
Instruction exception instead of a debug break 


¢ DME = 1: A debug break event causes the processor to enter Debug 
mode. 


20.4.2.2 DRCNTL: Debug Register Control register (DRO) 


The DRCNTL register is accessible only to internal-access resident debugger 
programs. It duplicates a subset of bits from two external-access registers that 
constitute part of the N-Wire interface—the Debug Module System 
(DM_SYSTEM) register and the Debug Module Control (DM_CONTROL) 
register, described in Section 20.5.2.5 and Section 20.5.2.6. 


Although the DRCNTL bits duplicate some of the DM_SYSTEM and 
DM_CONTROL bits, the DRCNTL bits are a separate set of bits; they are not 
shared by the DM_SYSTEM and DM_CONTROL registers. Either the DRCNTL 
register is active or the two external-access registers are active; use of these 
registers is mutually exclusive. DRCNTL is used when the debug module is in 
reset (i.e., for internal access). The two external-access registers are used when the 
debug module is not in reset. The DIJNIT bit in the DM_SYSTEM register 
determines whether the debug module is in reset. 


Figure 20-2 shows the register format. Table 20-2 describes the register fields. 


31 0 


See table below for field descriptions | 


Figure 20-2 Debug Register Control (DRCNTL) Register Format 
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Table 20-2 Debug Register Control Register (DRCNTL) Fields 


Bits Field Description 
1:0 Reserved — 


Mask User Reset in Debug mode 
1 > Ignores Reset* input while in Debug mode 
0 — Accepts Reset* input while in Debug mode 
Defaulted to 1 at the debug module initialization. 


Mask User NMI* 
1 > Ignores NMI* 
0 — Accepts NMI* 
Defaulted to 0 at the debug module initialization. 


2 MRST 


3 MNMI 


Mask User Interrupts 

1 > Ignores user interrupt input 

0 — Accepts user interrupt input 
Defaulted to 0 at the debug module initialization. MINT effects 
interrupts via the Int signals or via an external write. Software 
interrupts are not masked. 


Single-Step Break 
Single-step allows the user to execute one Normal mode instruction. 
Single-step occurs after a DRET instruction. The processor returns to 
Normal mode, executes a single instruction, and breaks back into 
Debug mode. 

1 > Enables single-step break (single-step mode) 

0 — Disables single-step break 
Defaulted to 0 at the debug module initialization. 


Break Cause 
This consists of multiple bits. One bit is assigned for each break cause 
and a corresponding bit is set when the break occurred. Multiple bits 
are set if the break occurred by multiple break causes. The bit 
assignments are defined as follows: 
Bit 6 — External break 
13:6 BRK_CAUSE Bit 7 — Single-step 
Bit 8 — Software breakpoint 
Bit 9 > Reserved 
Bit 10 > Reserved 
Bit 11 — Instruction-address breakpoint 
Bit 12 — Data access (address or data) breakpoint 
Bit 13 > Reserved 


4 MINT 


®) STEP 
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Table 20-2 Debug Register Control Register (DRCNTL) Fields (continued) 


Bits 


Field 


Description 


14 


DBM 


Debug Mode 

Indicates Debug mode or Normal mode (read only). 
1 > Normal mode (User, Supervisor, or Kernel mode) 
0 — Debug mode 


17:15 


CPU_STAT 


Processor Status 

000 — Reset (highest) 
00 1 > Reserved 

0 10 —- Reserved 

011 — Reserved 

100 > Reserved 

10 1 — Reserved 

1 10—- Reserved 

1 11 — Normal (lowest) 


20:18 


Reserved 


21 


DM_EXCEPT 


Debug Mode Exception 
Indicates that an exception occurred while in Debug mode (read/ 
write). 
Read 1 — Instruction executed in Debug mode caused exception 
Read 0 — No exception in Debug mode since flag was cleared 
Write 1 — No operation 
Write 0 — Clear exception flag 
If any instruction other than Load or Store causes an exception, the 
results and processor state are undefined. 


22 


BKIODIR 


BkTglIO* direction 
Indicates the direction of the BkTgIO* signal. 
1 > Input 
0 — Output 
Defaulted to 1 at the debug module initialization. 


23 


BKIOEN 


BkTgIO* Break Enable 
1 — Enable driving of BkTgIO* trigger output at a debug break 
event, or to break the processor at a BkTgIO* break input 
0 — Disable 

Defaulted to 0 at the debug module initialization. 


24 


BKIOTEN 


BkTgIO* Trigger Enable (read/write) 
1 — Enable detected internal trigger events to the BkTgIO* signal 
when it is configured in the output direction 
0 — Disable 

Defaulted to 0 at the debug module initialization. 


31:25 
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20.4.2.3 DEPC: Debug Exception PC (DR1) 


When entering Debug mode, the DEPC register contains the virtual address of the 
instruction where the debug break occurred. This is the address at which Normal 
mode instruction processing may resume after exiting Debug mode. Figure 20-3 
shows the register format. 


63 0 


DEPC | 


Figure 20-3 Debug Exception PC (DEPC) Register Format 


20.4.2.4 DDATAO: Debug Data Monitor 0 and Monitor Data (DR2) 


The DDATAO register and the JTAG-accessible Monitor Data (MON_DATA) 
register (see Section 20.5.2.8) are the same register. The register is used for 
external access when the debug module is active, and therefore is scannable. It can 
also be used as a scratch register in Debug mode. The user is responsible for 
ensuring that the types of use for the register do not overlap. Figure 20-4 shows 
the register format. 


63 0 


DDATAO | 


Figure 20-4 Debug Data Monitor 0 (DDATAO) Register Format 
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20.4.2.5 DDATALI: Debug Data Monitor | (DR3) 


The DDATAI register can be used as a stack pointer or scratch register. Figure 
20-5 shows the register format. 


63 0 


DDATA1 | 


Figure 20-5 Debug Data Monitor 1 (DDATA1) Register Format 


20.4.2.6 Instruction Address Breakpoint 


Instruction address hardware breakpoints are supported by three registers: IBC, 
IBA, and IBAM. These three registers are used in both internal-access and 
external-access Debug mode. 


To determine an instruction address match, the program counter is compared with 
the Breakpoint instruction address before TLB translation. If the breakpoint 
condition is met and the break is enabled in the IBC register, the processor enters 
Debug mode. The instruction that caused the breakpoint is not executed. 


The VR5432 implementation of instruction address breakpoints has the following 
limitations: 


¢ Only doubleword addresses can be compared (IBAM[2:0] must be 
111,) for instruction address breakpoints. 


¢ Triggers (BkTgIO* trigger output) are not supported for instructio 
address breakpoints 


¢ Only 40-bit virtual addresses are supported for instruction address 
breakpoints. 


The following registers are used to set an instruction address breakpoint. 
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IBC: Instruction Breakpoint Control/Status register (DR4) 


The IBC register is the control and status register for the instruction-address 
breakpoint. Figure 20-6 shows the register format. Table 20-3 describes the 


register fields. 


31 


0 


See table below for field descriptions 


Figure 20-6 Instruction Breakpoint Control/Status (IBC) Register Format 


Table 20-3 Instruction Breakpoint Control/Status (IBC) Register Fields 


Bits Name Description 
Breakpoint Status 
0 BS 1 > Breakpoint match occurred. 
0 — Breakpoint match did not occur. 
Cleared to 0 on cold reset. 
Break Enable. Causes a debug break when a breakpoint match 
occurs. 
1 BE 1 > Enabled. 
0 — Disabled. 
Cleared to 0 on cold reset. 
2 Reserved —= 
Invert address match condition 
3 INV 1 > Address matches when conditions don’t match. 
0 — Address matches when conditions match. 
ASID compare mask 
4 ASIDM 1 — Address match is not qualified with ASID matching. 
0 — Address match is qualified with ASID matching the 
current processor ASID. 
12:5 ASID Address Space ID to compare. 
31:13 Reserved —_ 
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IBA: Instruction Breakpoint Address register (DR8) 


The IBA register contains the address of the instruction breakpoint. When the 
instruction stored at the specified address is being executed, the condition is met. 
Figure 20-7 shows the register format. 


Even though a 64-bit IBA register is specified, only 40 bits of the virtual address, 
plus the region bits (63:62) are compared. The unused address bits must be sign 
extended to bit 61 for all address spaces except xkphys. For xkphys address space, 
bits 61:59 must also indicate the correct cacheability attribute, because these bits 
are compared. Please refer to the memory mapping and address space discussions 
in Chapter 4. 


63 62 61 40 39 0 


region see text 40-bit virtual address 


Figure 20-7 Instruction Breakpoint Address (IBA) Register Format 


IBAM: Instruction Breakpoint Address Mask register (DR9) 


The IBAM register contains the mask for IBA. If a bit of this register is 1, the 
corresponding bit of IBA is not compared. Figure 20-8 shows the register format. 


As with the IBA register, even though a 64-bit IBAM register is specified, only 40 
bits of the virtual address, plus the region bits (63:62), are compared. The unused 
address bits must be sign extended to bit 61 for all address spaces except for 
xkphys. For xkphys address space, bits 61:59 must also indicate the correct 
cacheability attribute, because these bits are compared. 


63 62 61 40 39 0 


region see text 40-bit virtual address mask 


Figure 20-8 Instruction Breakpoint Address Mask (IBAM) Register Format 
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20.4.2.7 
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Data Access Breakpoint 


Data access hardware breakpoints (break on address, break on data, or break on 
both) are supported by five registers: DBC, DBA, DBAM, DBD, and DBDM. 
These five registers are used in both internal-access and external-access Debug 
mode. 


To determine a data instruction address match, the program counter is compared 
with the breakpoint instruction address before TLB translation. If the breakpoint 
condition is met and the break is enabled, the processor enters Debug mode. If 
only a data address condition is specified, the instruction that caused the 
breakpoint is not executed. If a data access condition (load or store) is specified in 
the DBC register, the break occurs sometime after the instruction that caused the 
breakpoint. If the breakpoint condition is met and the trigger is enabled in the 
DBC register, the processor asserts a trigger on BkTgIO* output. 


The VR5432 implementation of data access breakpoints has the following 
limitations and features: 


e For data access store breakpoints, only doubleword addresses can be 
compared (IBAM[2:0] must be 111,). 


¢ For data access load breakpoints, data access sizes other than 64 bits 
are supported. 


¢ Only 40-bit virtual addresses are supported for data access 
breakpoints. 


The processor supports data access sizes other than 64 bits. For loads, the DBDM 
register must mask all bits that are not part of the data access size, or the DBD 
register must specify the proper sign-extended 64-bit value. For stores, only data 
access for doublewords is supported. 


The following registers are used to set a data access breakpoint. 
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DBC: Data Breakpoint Control/Status register (DR5) 


The DBC register provides control and status for the data address and data access 
breakpoints. Figure 20-9 shows the register format. Table 20-4 describes the 
register fields. 


31 0 


See table below for field descriptions | 


Figure 20-9 Data Breakpoint Control/Status (DBC) Register Format 


Table 20-4 Data Breakpoint Control/Status (DBC) Register Fields 


Bits 


Name Description 


Breakpoint Status 

1 > Breakpoint match occurred 

0 — Breakpoint match did not occur 
Cleared to 0 on cold reset. 


BS 


Break Enable at Address Match 

Causes a debug break when the address match condition is met. 
1 > Enabled 
0 — Disabled 


BEA 


Trigger Enable at Address Match 

Outputs a trigger on BkTgIO* when the address match 
TEA condition is met. 

1 — Enabled 

0 — Disabled 


Invert Address-Match Condition 
AINV 1 — Address matches when conditions don’t match 
0 — Address matches when conditions match 


ASID Compare Mask 
1 — Address match is not qualified with ASID matching 
0 — Address match is qualified with ASID matching the 
current processor ASID 


ASIDM 


ASID Address Space ID to compare 


Reserved = 
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Table 20-4 Data Breakpoint Control/Status (DBC) Register Fields (continued) 


Bits 


Name 


Description 


16 


TS 


Trigger Status 
1 > Trigger occurred 
0 — Trigger has not occurred 


17 


BED 


Break Enable at Data Match 
Causes a debug break when the data condition is met. 

1 > Enabled 

0 > Disabled 
BEA and BED are in effect if either BERD or BEWR are set. 
In this case, if both BEA and BED are set, a break occurs only 
when both conditions are met. If both are cleared, a break 
occurs regardless of the compare results. 


18 


TED 


Trigger Enable at Data Match 
Outputs a trigger on BkTgIO* when the data match condition 
is met. 

1 — Enabled 

0 — Disabled 
TEA and TED are in effect if either TERD or TEWR is set. In 
this case, if both TEA and TED are set, a trigger is output when 
both conditions are met. If both are cleared, a trigger is output 
regardless of the compare results. 


19 


DINV 


Invert Data Match Condition 
1 > A data match occurs when conditions don’t match 
0 — A data match occurs when conditions match 


20 


BERD 


Break Enable for Read Access (i.e., Loads) 
1 > Break enabled for loads 
0 —> No break enabled for loads 
Cleared to 0 on cold reset. 


21 
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BEWR 


Break Enable for Write Access (i.e., Stores) 

1 > Break enabled for stores 

0 > No break enabled for stores 
Cleared to 0 on cold reset. If neither BERD nor BEWR are set, 
no data access debug break occurs. These are the primary break 
enable bits. 
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Table 20-4 Data Breakpoint Control/Status (DBC) Register Fields (continued) 


Bits 


Name 


Description 


22 


TER 


Trigger Enable for Read Access (i.e., Loads) 
1 > Trigger enabled for loads 
0 > No trigger enabled for loads 
Cleared to 0 on cold reset. 


23 


TEWR 


Trigger Enable for Write Access (i.e., Stores) 

1 > Trigger enabled for stores 

0 — No trigger enabled for stores 
Cleared to 0 on cold reset. If neither TERD nor TEWR is set, 
no data access trigger occurs. These are the primary trigger- 
enable bits. 


31:24 


Vr5432 Microprocessor User’s Manual 


Reserved 


DBA: Data Breakpoint Address register (DR12) 


The DBA register contains the address of the data breakpoint. When the 
instruction stored in the specified address is being executed, the condition is met. 
Figure 20-10 shows the register format. 


Even though a 64-bit DBA register is specified, only 40 bits of the virtual address, 
plus the region bits (63:62) are compared. The unused address bits must be sign- 
extended to bit 61 for all address spaces except for xkphys. For xkphys address 
space, bits 61:59 must also indicate the correct cacheability attribute, because 
these bits are compared. Please refer to the memory mapping and address space 
discussions in Chapter 4. 


63 62 61 


40 39 0 


region 


see text 40-bit virtual address 


Figure 20-10 Data Breakpoint Address (DBA) Register Format 
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DBAM: Data Breakpoint Address Mask register (DR13) 


The DBAM register contains the bit mask for DBA. If a bit of this register is 1, the 
corresponding bit of DBA is not compared. Figure 20-11 shows the register 
format. 


As with the DBA register, even though a 64-bit DBAM register is specified, only 
40 bits of the virtual address, plus the region bits (63:62), are compared. The 
unused address bits must be sign extended to bit 61 for all address spaces except 
for xkphys. For xkphys address space, bits 61:59 must also indicate the correct 
cacheability attribute, because these bits are compared. 


63 62 61 40 39 0 


region see text 40-bit virtual address mask 


Figure 20-11 Data Breakpoint Address Mask (DBAM) Register Format 


DBD: Data Breakpoint Data register (DR14) 


The DBD register contains the data of the data breakpoint. The break condition is 
met when this data is read or written. Figure 20-12 shows the register format. 


63 0 


Figure 20-12 Data Breakpoint Data (DBD) Register Format 
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DBDM: Data Breakpoint Data Mask register (DR15) 


The DBDM register contains the bit mask for DBD. If a bit of this register is 1, the 
corresponding bit of DBD is not compared. For partial word or partial doubleword 
operations, the unused bits must be masked. Figure 20-13 shows the register 
format. 


63 0 


DBDM | 


Figure 20-13 Data Breakpoint Data Mask (DBDM) Register Format 


20.5 External Access 


In the processor’s external-access Debug mode, an external debug tool controls 
processor operations through the JTAG test access port (TAP). The JTAG port 
supports not only JTAG testing but also N-Wire and N-Trace testing. In external 
access, the debug tool can access all Debug mode registers, including the debug 
registers (Section 20.4.2) available to an internal-access resident debugger 
program and the JTAG, N-Wire, and N-Trace registers described below. This 
access to debug resources gives external-access Debug mode more control of the 
processor than does internal-access Debug mode. 
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20.5.1 JTAG Port Signals 


20.5.1.1 Signal summary 


Table 20-5 JTAG Test Access Port Signal 


Name Definition 


Direction 


Description 


JTAG Test Clock 


JTCK : 
input 


Input 


The processor accepts a serial clock on the 
JTCK input. At the rising edge of JTCK, both 
JTDI and JTMS are sampled. The maximum 
frequency of JTCK is 33 MHz, and it runs 
asynchronously to the processor clock, 
SysClock. The ratio of SysClock to JTCK 
must be at least 4:1 for proper N-Wire and N- 
Trace synchronization. 


JTAG Test Mode 


ATS Select 


Input 


JTDI JTAG Test DataIn | Input 


The JTAG command signal. It is decoded by 
the TAP controller to control test operations. 
The signal has an internal pull-up so that its 
level is High when the debug tool is not 
connected. 


Data is serially scanned in through this signal. 
The signal has an internal pull-up so that its 
level is High when the debug tool is not 
connected. 


JTDO JTAG Test Data Out | Output 


Data is serially scanned out through this signal 
on the falling edge of JTCK. Per the IEEE- 
1149.1 standard, the JTDO output is tristated 
unless data is actively being scanned. 


TrcData (3:0) | N-Trace Data Port Output 


This bus is used for output of all trace packets 
generated as a result of processor execution. 
Trace packets can consist of one or more clock 
cycles of data on this bus. 


TrcEnd N-Trace End Output 


760 


Assertion of this signal indicates the end of a 
trace packet on the TrcData (3:0) bus. 
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Table 20-5 JTAG Test Access Port Signals (continued) 


Name 


Definition 


Direction 


Description 


TreClk 


N-Trace Clock 


Output 


This clock is generated for the benefit of test 
equipment that requires a clock reference for 
trace information. It runs at the same 
frequency as SysClock. 


RMode*, 
BkTgIO* 


N-Wire Reset Mode, 
or 

N-Wire Break or 
Trigger I/O 


Input/ 
output 


This pin supports two N-Wire signals: debug 
reset (RMode*), and debug break or trigger 
(BkTgIO*). During assertion of ColdReset*, 
the pin carries the RMode* input signal. In all 
other states the pin carries the BkTgIO* debug 
break input or debug trigger output signal, 
depending on its setup in various debug 
registers (Section 20.4.2) and JTAG- 
accessible registers (Section 20.5.2). The pin 
operates at SysClock frequency and must be 
driven synchronously with SysClock. The pin 
has an internal pull-up so that its level is High 
when the debug tool is not connected. 

See Section 20.5.1.2 and Section 20.5.1.3 for 
details. 


Tristate 


Tristate Outputs 


Input 


This signal floats all processor outputs to 
allow isolation for board-level tests. 


20.5.1.2 


Reset mode (RMode*) signal 


When ColdReset* is deasserted, the RMode* input is sampled to set the value of 
the RESET bit in the Debug Module Control (DM_CONTROL) register, which is 
the N-Wire debug reset variable. The RMode* value initializes the RESET bit as 


follows: 


¢« Low. Enables debug reset (when RMode* is sampled low, the RESET 
bit is set to 1). The debug module asserts debug reset to th 
processor. The effect on the processor of asserting RMode* is the 
same as asserting Reset*. 


¢ High. Disables debug reset. The debug module does not assert debu 
reset to the processor. 
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20;91.3 


Break or Trigger I/O (BkTgIO*) signal 


762 


After ColdReset* is deasserted, BkTgIO* acts as a debug break input or a debug 
trigger output. The direction of the BkTgIO* signal defaults to input at debug 
module initialization, but its direction can thereafter be configured in the JTAG- 
accessible DM_SYSTEM register (see Section 20.5.2.5). 


If the signal is configured for output, it can be enabled to act as a trigger to an 
external debug tool, or it indicates whether the processor is currently in Debug 
mode: 


¢ Low (1 cycle pulse) The debug module has detected one or more 
processor internal trigger events 


* Low (> 1 cycle). The processor is in Debug mode 


¢ High. The processor is operating in Normal mode (User, Supervisor, 
or Kernel mode). 


Since the processor is a superscalar core running at a higher frequency than the 
system interface, trigger events can occur much faster than BkTgIO* can report 
them. Trigger events can be reported at the maximum rate of one every two 
SysClock cycles (1 cycle pulse). All trigger events that have occurred since the 
last BkTgIO* trigger output are reported in one trigger. If the processor enters 
Debug mode, any trigger events that have not been reported will not be reported. 


If the signal is configured for input, it acts as a debug break from an external debug 
tool that can force the processor from Normal mode (User, Supervisor, or Kernel 
mode) to Debug mode: 


e« Low. Break request, forces processor into Debug mode 


¢ High. Maintain current processor mod 


The debug break request needs to be asserted for only one cycle. The processor 
enters Debug mode as soon as it is conveniently possible. If the processor is 
already in Debug mode, or if there is already an outstanding debug break request, 
a subsequent debug break request has no effect. 
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Board connector for debug tool 


System designers are encouraged to incorporate into their board design a 26-pin 
high-density connector that provides 13 signals and 13 grounds. This assures 
maximum performance and eliminates noise problems. The target connector is a 
0.05”-pitch 26-pin header connector, Samtec part number FTSH-113-01-L-D 
(through-hole) or FTSH-113-01-L-DV (surface mount), or equivalent. The 26 
pins are allocated to 12 signals (and one spare) and 13 grounds. The connector 
spacing is a convenient 0.05” x 0.05” and provides easy cabling to external 
equipment. 


Alternatively, there is a 10-pin connector option. This smaller connector contains 
only the basic JTAG boundary scan TAP signals and excludes the real-time trace- 
related signals. 


Figure 20-14 shows the two connectors. Table 20-6 and Table 20-7 list their 
pinouts. 
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Figure 20-14 JTAG Connector Types 
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Table 20-6 26-Pin JTAG Connector Signals 


Table 20-7 10-Pin JTAG Connector Sign a1 


Pin Signal 110) Target Termination 
1 Reserved _— _— 
3 JTDI Input 1-KOhm pull-up resistor 
5 JTDO Output 33-ohm series resistor 
7 JTMS Input 1-KOhm pull-up resistor 
9 JTCK Input 1-KOhm pull-up resistor 
11 Tristate Input 1-KOhm pull-down resistor 
13 See Input/Output 1-KOhm pull-up resistor 
15 TrcData 0 Output 33-ohm series resistor 
17 TrcData | Output 33-ohm series resistor 
19 TreData 2 Output 33-ohm series resistor 
21 TrcData 3 Output 33-ohm series resistor 
23 TrcEnd Output 33-ohm series resistor 
25 TreClk Output 33-ohm series resistor 


Pin Signal 110) Target Termination 
1 Reserved — — 
3 JTDI Input 1-KOhm pull-up resistor 
5 JTDO Output 33-ohm series resistor 
7 JTMS Input 1-KOhm pull-up resistor 
9 JTCK Input 1-KOhm pull-up resistor 


In addition to the above debug port connector, system designers may also want to 
include a 208-pin PQFP test socket. The socket or connector should have the exact 
pinout, shape, and layout of the actual 208-pin PQFP processor chip and should 

be placed as close as possible to the processor chip. This extra socket or connector 
enables connection to a logic analyzer preprocessor between the target board and 
the processor without having to remove the processor from the board. The 
preprocessor can then support full visibility to all external processor signals, as 
well as real-time trace and inverse assembly. 
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20.5.2 
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JTAG-Accessible Registers 


Table 20-8 lists the registers accessible by a debug tool through the JTAG port. 


These registers support JTAG, N-Wire, and N-Trace functions. 


Table 20-8 JTAG-Accessible Regist er 


F ; Width 
Mnemonic Register Name (Bits) 

_— JTAG Instruction register 5 

— JTAG Bypass register 1 

— JTAG Boundary Scan register 109 

— Processor Type register 25 

DM_SYSTEM N-Wire Debug Module System register 7 

DM_CONTROL _ | N-Wire Debug Module Control register 22 

MON_INST N-Wire Monitor Instruction register 64 
N-Wire Monitor Data register 

MON_DATA This is the same as Debug register DR2 64 
(DDATAO). See Section 20.4.2.4. 

TRCSYS N-Trace System register 11 

IBC 

ee N-Wi d N-T: Hard Breakpoint 

IBAM -Wire and N-Trace Hardware Breakpoin 

DBC registers 

DBA These are the same as Debug Registers DR4— | Various 
DR15. See Section 20.4.2.6 and Section 

pee 20.4.2.7 

DBD 4.2.7. 

DBDM 
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20:5.2:1 JTAG Instruction register 
The JTAG Instruction register holds the opcodes for JTAG, N-Wire, and N-Trace 
operations. Instructions are entered into the test logic during an instruction register 
scan sequence in the TAP controller. Figure 20-15 shows the register format. 
Table 20-9 describes the JTAG, N-Wire, and N-Trace instructions. 
4 0 
Instruction 
5 
Figure 20-15 JTAG Instruction Register Format 
Table 20-9 JTAG Instructions 
Instruction Opcode Data Register Function 
EXTEST 00000 | JTAG Boundary Scan register ae errenity extemal ia he 
Allows a snapshot of the 
normal operation of the chip to 
be taken and examined. Also 
SAMPLE/ P allows data to be preloaded into 
PRELOAD QOOOL. (TEAS BOUnGaty Sean zeeiclee” || a caralleloupawor thc 
Boundary Scan register prior to 
another instruction such as 
EXTEST. 
DM_SYSTEM  |00010 N-Wire Debug Mode System Accesses the Debug Module 
register System register 
DM_CONTROL |00011 N-Wire Debug Mode Control Accesses the Debug Module 
register Control register 
PROCTYPE 00100 Processor Type register ne me Processor Ly pe 
register 
NTRACE_SYS |00101 | N-Trace System register Aer reoine Tite ysem 
register 
MON_INST 01000 N-Wire Monitor Instruction Accesses the Monitor 
register Instruction register 
MON_DATA [01001 _ | N-Wire Monitor Data register ssstes he Momton Date 
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Table 20-9 JTAG Instructions (continued) 


Instruction Opcode Data Register Function 
CACHE_TEST |01100 Cache Test register Enables Cache Test mode 
HIGHZ 01110 JTAG Bypass register Tristates all outputs of the chip 

Connects JTDI to JTDO 
BYPASS 11111 JTAG Bypass register through the |-bit Bypass 
register 


20,5.2.2 
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JTAG Bypass register 


The JTAG Bypass register is | bit wide. When the TAP controller is in the Shift- 
DR (Bypass) state, the data on the JTDI signal is shifted into the Bypass register, 
and the data on Bypass register output shifts to the JTDO output signal. 

Figure 20-16 shows the register format. 


0 


=) 


Figure 20-16 JTAG Bypass Register Format 


The Bypass register is like a short-circuit. It allows bypassing of board-level 
devices in the boundary scan chain that do not require a specific test. Use of the 
register speeds up access to Boundary Scan registers in those ICs that remain 
active in the board-level test data path. 
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20.5.2.3 JTAG Boundary Scan register 


The JTAG Boundary Scan register is a single bus comprising a 74-bit Shift 
register, each bit of which is connected to a processor signal. The Boundary Scan 
register retains states for all of the processor’s input and output signals, except for 
some clock and phase-locked loop signals. The external signals can be configured 
to drive any arbitrary pattern, depending on the data scanned into the Boundary 
Scan register while in the JTAG Shift-DR state. Data driven into the signals from 
other devices can be examined while in the Capture-DR state. 


Figure 20-17 shows the register format. Table 20-10 describes the register bits in 
their scan order. 


73 0 


SysADCO See table below for field descriptions jSysADEn 


74 


Figure 20-17 JTAG Boundary Scan Register Format 


The least-significant bit, jSysADEn, is the JTAG output enable bit for all 
processor outputs. Output is enabled when this bit is set to 1. The remaining 73 
bits correspond to the processor’s 73 signal pads, as shown in Table 20-10. The 
scan starts by shifting the least-significant bit out of the Boundary Scan register, 
so the first scan-out bit is the jSysADEn signal. 


Table 20-10 JTAG Boundary Scan Register Order 


No Signal No. | Signal | No. Signal No. Signal 

1 jSysADEn |20 |NMI* 39 | SysCmd7 58 | RdRdy* 

2 Tristate 21 |SysAD8 |40  |SysCmd6 59 SysAD30 

3 ColdReset* |22 |SysAD9 /|41 SysCmd5 60 ‘| ValidOut* 

4 BigEndian /23 |SysAD10 /42 | SysCmd4 61 SysAD31 

5 DivModeO /24 |SysAD11 | 43 SysCmd3 62 | PReq* 

6 DivModel |25 |SysAD12 |44 — | SysCmd2 63 SysADO 

7 ByPassPLL |26 |SysAD13 |45 | ValidIn* 64 |SysAD1 

8 TrcEnd 27 |SysAD14 |46 | OptionR43k* |65  |SysAD2 

9 TreData3 |28 |SysADI15 |47 | Reset* 66 |SysAD3 

10 |TrcData2 |29 |SysAD16 |48 SysCmd1 67 |SysAD4 
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Table 20-10 JTAG Boundary Scan Register Order (continued) 


No. Signal No. | Signal | No. Signal No. Signal 
11 | TreClk 30 |SysAD17 |49 — | SysCmd0O 68 |SysAD5 
12 | TrcDatal 31 |SysAD18 )50 | ExtRqst* 69 |SysAD6 
13. |TrcData0 |32 |SysADI19 |51 SysAd25 70 |SysAD7 
14 |BkTgIO* |33 |SysAD20 /52 | Release* 71 |SysADC3 
15 | Int4 34 |SysAD21 |53 | SysAD26 72. |SysADC2 
16 | Int3 35 |SysAD22 |54 | SysAD27 73 ~|SysADCI1 
17 ‘| Int2 36 |SysAD23 |55 | SysAD28 74. |SysADCO 
18 | Intl 37 |SysAD24 |56 | SysAD29 

19 | IntO 38 |SysCmd8 |57 | WrRdy* 


20.5.2.4 Processor Type register 


This register contains the CPU type and the debug module version. Figure 20-18 
shows the register format. Table 20-11 describes the register fields. 


24 16 15 0 


DMV PID 
9 16 


Figure 20-18 Processor Type Register Format 


Table 20-11 Processor Type Register Format Register Fields 


Bits Field Description 
15:0 DMV Debug module version. Set to 10H. 
24:16 PID Processor ID. Set to 5400H. 
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20.5.2.5 N-Wire Debug Module System register (DM_SYSTEM) 


The DM_SYSTEM register contains the basic configuration fields for debug 
module initialization, N-Wire RMode*/BkTgIO* signal functions, and N-Trace 
functions. Figure 20-19 shows the register format. Table 20-12 describes the 
register fields. Certain fields of this register are copied into the DRCNTL debug 
register, described in Section 20.4.2. 


6 0 
See table below for field descriptions | 
t 


Figure 20-19 N-Wire Debug Module System (DM_SYSTEM) Register Format 


Table 20-12 N-Wire Debug Module System (DM_SYSTEM) Register Fields 


Bits Name Description 


Initialize the Debug Module (read/write) 

1 > Resets (initializes) the debug module 

0 — Releases reset of debug module (enable debug module) 
0 DINIT Defaulted to 1 at processor ColdReset* or JTAG in reset. When 
DINIT = 1, all the N-Wire register bits are at their reset value. The 
N-Wire function cannot be loaded unless DINIT is enabled 
(DINIT = 0). 


RMode* /BkTgIO* Signal Implementation (read only) 
1 BKTGIO 1 — Implemented 
0 — Not implemented 


BkTgIO* Direction (read/write) 
1 > Input 
0 — Output 
Defaulted to 1 at the debug module initialization. 


BkTgIO* Break Enable (read/write) 
1 — Enable driving of trigger output on BkTgIO* at a 
3 BKIOBEN processor break, or to break the processor at a BkTgIO* input 
0 — Disable 
Defaulted to 0 at the debug module initialization. 


2 BKTGIODIR 
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Table 20-12 N-Wire Debug Module System (DM_SYSTEM) Register Fields (continued) 


Bits Name Description 


BkTgIO* Trigger Enable (read/write) 

1 — Enable detected internal trigger events to the BkTgIO* 
4 BKIOTEN signal when it is configured in the output direction 

0 — Disable 
Defaulted to 0 at the debug module initialization. 


N-Trace Implementation (read only) 
5 NTRACE 1 — Implemented 
0 — Not implemented 


N-Trace Port Enable (read/write) 
1 —> Enable 
0 — Disable 
Defaulted to 0 at the debug module initialization. 


6 NTRACEN 


20.5.2.6 N-Wire Debug Module Control register (DM_CONTROL) 


The DM_CONTROL register contains enabling and status fields for debug reset, 
processor breaking, interrupt and exception handling, single-stepping, and 
execution of N-Wire Monitor instructions. Figure 20-20 shows the register 
format. Table 20-13 describes the register fields. Certain fields of this register are 
copied into the DRCNTL debug register, as described in Section 20.4.2. 


21 0 
See table below for field descriptions 
22 


Figure 20-20 N-Wire Debug Module Control (DM_CONTROL) 
Register Format 
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Table 20-13 N-Wire Debug Module Control (DM_CONTROL) Register Fields 


Bits 


Name 


Description 


RESET 


Debug Reset (read/write) 

1 > Requests debug reset 

0 — Releases debug reset 
Defaulted according to the level of the RMode* input at 
processor ColdReset*. When RMode* is active low, this register 
bit is active high. 


BREAK 


Break Request (read/write) 
Write 1 > Requests break 
Write 0 + No operation 
Read 1 — Command not completed (still requesting break) 
Read 0 —> Break is completed 
This bit is cleared when the break is completed. 
Defaulted to 0 at the debug module initialization. 


MRST 


Mask Reset* (read/write) 

1 > Ignores (masks) Reset* input while in Debug mode 

0 — Accepts Reset* input while in Debug mode 
Defaulted to 1 at the debug module initialization. ColdReset* is 
not masked by this bit. 


MNMI 


Mask NMI* (read/write) 

1 — Suppress the occurrence of NMI* 

0 — Do not suppress the occurrence of NMI* 
Defaulted to 0 at the debug module initialization. 


MINT 


Mask Interrupts (read/write) 

1 > Ignores user interrupt input 

0 — Accepts user interrupt input 
This mask affects interrupts via the Int* signals or via an external 
write. Software interrupts are not masked. Defaulted to 0 at the 
debug module initialization. 


STEP 


Single-Step (read/write) 
This bit allows the user to execute one Normal mode instruction 
followed by a break. Single-step occurs after a DRET 
instruction. The processor returns to Normal mode, executes a 
single instruction, and breaks back into Debug mode. Enabling a 
single-step break while the processor is in Normal mode results 
in undefined behavior. 

1 > Enable single-step break (single-step mode) 

0 — Disable single-step break 
Defaulted to 0 at the debug module initialization. 
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Table 20-13 N-Wire Debug Module Control (DM_CONTROL) Register Fields (continued) 


Bits Name Description 


Break Cause (read only) 

This field consists of multiple bits. One bit is assigned for each 
break cause and the corresponding bit is set when the break 
occurred. Multiple bits are set if the break occurred by multiple 
break causes. Break Cause is cleared by DRET or by processor 
reset. 

The bit assignments are defined as follows: 

13:6 BRKCAUSE Bit 6 — External break 

Bit 7 — Single-step 

Bit 8 — Software breakpoint 

Bit 9 > Reserved 

Bit 10 > Reserved 

Bit 11 —> Instruction address breakpoint 

Bit 12 — Data access (address or data) breakpoint 

Bit 13 > Reserved 


Debug or Normal Mode (read only) 
14 DBM 1 > Debug mode is active 
0 — Normal mode is active 


CPU Status (read only) 
The processor status is encoded as follows: 
000 — Reset (highest) 
00 1 > Reserved 
010 —- Reserved 
01 1—-— Reserved 
100 —- Reserved 
10 1 — Reserved 
110—- Reserved 
1 1 1 — Normal mode (lowest) 


17:15 CPUSTAT 


Active Flag for Processor Clock (read/write) 
This bit indicates clock activity. The debug tool can use this bit 
to detect whether a clock is supplied into the processor from the 
target system board. 

Write 1 — No operation 

Write 0 — Clear active flag 

Read 1 — Clock is active 

Read 0 — Clock has not been active since flag was cleared 
This flag is set when there's any activity on the clock. This flag 
is cleared when the debug tool writes 0 into this bit. 


18 ACTFLG_CLK 
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Table 20-13 N-Wire Debug Module Control (DM_CONTROL) Register Fields (continued) 


Bits 


Name 


Description 


19 


ACTFLG_BUS 


Active Flag for Bus (read/write) 
This bit indicates bus activity. 

Write 1 — No operation 

Write 0 — Clear active flag 

Read 1 — Bus cycle is active 

Read 0 > Bus cycle has not occurred since flag was cleared 
This flag is set when there is any activity on the bus. It is cleared 
when the debug tool writes 0 into this bit. 


20 


MON_INSTEXEC 


Monitor Instruction Execution (read/write) 
Setting this bit causes the processor to fetch and execute the 
instruction in the MON_INST register. 
Write 1 — Fetches and executes MON_INST instruction 
Write 0 + No operation 
Read 1 —> A monitor instruction is executing 
Read 0 — No monitor instruction is executing 


21 


DM_EXCEPT 


Debug Mode Exception (read/write) 
Read 1 — Instruction executed in Debug mode has caused 
an exception 
Read 0 — No exception in Debug mode since flag was 
cleared 
Write 1 — No operation 
Write 0 — Clear exception flag 
If any instruction other than a Load or Store causes an exception, 
the results and processor state are undefined. 
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20.5:277 N-Wire Monitor Instruction register (MON_INST) 


All JTAG accesses to system resources, such as the processor’s Normal mode and 
Debug mode registers, cache, external memory, and I/O are accessed via a 
monitor mechanism. The MON_INST and MON_DATA registers are used to 
insert instructions and data, respectively, into the processor. 


When the debug module is active (DINIT bit cleared to 0 in the DM_SYSTEM 
register) and a debug break occurs, processor instructions can be loaded and 
executed. The MON_INST instruction causes a processor instruction to be 
scanned into the write-only MON_INST register through the JTAG port. The 
MON_INSTEXEC bit in the DM_CONTROL register can then be set to cause the 
processor to execute the instruction. 


When executing Monitor instructions, the processor PC does not give meaning to 
the instruction. Therefore, all processor instructions and events that redirect the 
PC are not defined and produce unpredictable behavior. The DRET instruction is 
the only instruction that can be used for redirecting the PC. 


Figure 20-21 shows the register format. A monitor instruction can only be 
executed while the processor is in Debug mode and the debug module is not reset. 
If the MON_INSTEXEC bit is written to while in Normal mode, the results are 
undefined. Attempts to modify the MON_INST or MON_DATA registers while 
executing a Monitor instruction will result in undefined behavior. 


An example of loading processor instructions and data with the Monitor 
instruction is given in Section 20.5.3. 


63 0 


MON_INST 


Figure 20-21 N-Wire Monitor Instruction (MON_INST) Register Format 
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20.5.2.8 N-Wire Monitor Data register (MON_DATA) 


The MON_DATA register is identical to the DDATAO debug register (DR2), 
described in Section 20.4.2.4. The MON_DATA instruction and register are used 
in conjunction with the MON_INST instruction and register to insert data and 
instructions into the processor. The MON_DATA instruction causes data to be 
scanned into the MON_DATA (DR2) register through the JTAG port. The 
MON_INST instruction is then used to scan the MFDR instruction into the 
MON_INST register. The MON_INSTEXEC bit in the DM_CONTROL register 
can then be set to cause the instruction currently loaded in the MON_INST 
register (i.e., the MFDR instruction) to move this data into a general-purpose 
register. 


Figure 20-22 shows the register format. An example of loading processor 
instructions and data with the Monitor instruction is given in Section 20.5.3. 


63 0 


MON_DATA | 


Figure 20-22 N-Wire Monitor Data (MON_DATA) Register Format 
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20.5.2.9 N-Trace System register (TRCSYS) 


The TRCSYS register is used to control N-Trace reset and to give read-only 
information that indicates the processor’s N-Trace implementation parameters. 
Figure 20-23 shows the register format. Table 20-14 describes the register fields. 


10 8 7 6 5 4 3 2 0 
NDATAPIN |reserved] reserved] RESET| CE MODE 
3 1 1 1 2 3 


Figure 20-23 N-Trace System (TRCSYS) Register Format 


Table 20-14 N-Trace System (TRCSYS) Register Fields 


Bits Name Description 


Trace Mode (read only) 

The read value is 2H. Bits 2:1 are 01, indicating Target PC 
2:0 MODE (TPC) packet tracing in the N-Trace Level | mode (TPC 
packets at exceptions and indirect jumps). Bit 0 is 0, indicating 
a non-real-time trace. 


Trace Clock (TreClk) Divisor (read only) 
4:3 CLKDIV These bits are set by the DivMode (1:0) pins. The trace port 
runs at the system interface clock frequency. 


Reset N-Trace (read/write) 
1 + N-Trace is in reset. The NOP packet is output to the 
TrcData (3:0) port. The bit is initialized to | at 
ColdReset*. 
0 > N-Trace is active. Trace information is output to the 
TrcData (3:0) port. 
If N-Trace is reset, no trace packets are generated (NOP 
packets are on the internal N-Trace port). When reset is 
released, the value of the current PC is output and trace 
information proceeds. 


7:6 Reserved The read value is OH. 


N-Trace Data Pins (read only) 
10:8 NDATAPIN The read value is 4H, indicating 4 pins in the TrcData (3:0) 
port. 


fe) RESET 
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20.5.2.10 N-Wire and N-Trace Hardware Breakpoint registers 


Debug registers DR4—DR15 serve as the hardware breakpoint registers for both 
internal-access and external-access Debug mode. The registers are described in 
Section 20.4.2. 


20.5.3 N-Wire Monitor Data Download Example 


The following example describes the steps for downloading data into external 
memory using the N-Wire Monitor instruction and data resources. To do this, use 
the following sequence: 


1. 


2 
3. 
4 


Break into Debug mode with the debug module enabled (DINIT bit cleared). 
Scan the download data into the MON_DATA register via JTAG. 
Scan the MFDR instruction into the MON_INST register via JTAG. 


Set the MON_INSTEXEC bit in the DM_CONTROL register via JTAG. This 
causes the processor to execute the instruction in the MON_INST register, 
thus moving the data from the MON_DATA debug register (same as debug 
register DR2) into a general-purpose register in preparation for a store 
operation. 


Check for completion by checking the MON_INSTEXEC bit via JTAG. 
Scan a Store instruction into the MON_INST register via JTAG. 


Set the MON_INSTEXEC bit via JTAG. This causes the processor to execute 
the Store instruction in the MON_INST register, thus storing the data from 
the general-purpose register into memory. 


Check for completion by checking the MON_INSTEXEC bit via JTAG. 


Repeat steps 2—8 for each doubleword of data to be stored in memory. 


. Scan a DRET instruction into the MON_INST register via JTAG. 
. Set the MON_INSTEXEC bit via JTAG to execute the DRET. 
. Check for completion by checking the MON_INSTEXEC bit viaJTAG. When 


the DRET is complete, the processor has returned to Normal mode. 
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20.5.4 
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N-Trace Packets 


The processor can trace its internal instruction execution by using the N-Trace 
protocol. It uses the TrcData (3:0), TrcEnd, and TrcClk signals on the JTAG port 
(Section 20.5.1) to send N-Trace packets to an external debug tool. The processor 
supports the N-Trace packets shown in Table 20-15. All packets maintain a 4-bit 
code definition and output information that is a multiple of four bits. 


Table 20-15 N-Trace Packet Types 


Mnemonic Code Description 
EXP 0,1,1,0,<exp_id> Exception 
LSEQ 0,1,1,1 Long Sequential Execution 
NOP 0,0,0,0 No Operation 
NSEQ 1,0,0,0,<seq #> Non-Sequential Operation 
TPC 0,1,0,0,<program_counter> Target PC 


The processor generates useful trace packets only in Normal mode (User, 
Supervisor, or Kernel mode). It does not generate trace packets (other than NOP) 
in Debug mode; instead, NOP packets are continuously output on the N-Trace 


interface. When the processor is at the instruction boundary before entering Debug 
mode, all packets that have been generated are output. The processor also finishes 
all pending system interface operations before entering Debug mode. When the 
processor leaves Debug mode, it generates a Target PC (TPC) trace packet to 
indicate the instruction address where the normal execution resumes. 


The processor supports only one N-Trace mode (N-Trace Level 1 mode, with TPC 
packets at exceptions and indirect jumps). The mode is non-real time, which 
implies that the CPU pipeline stalls if the trace buffer fills. N-Trace is either on or 
off and does not have additional control options. 
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Exception (EXP) 
¢ Mnemonic EXP <exp_id> 
¢ Code 0,1,1,0,<exp_id> 


The EXP packet is output when an exception occurs within the processor. The 
<exp_id> field contains the Exception vector address taken. Bit 3 is equal to the 
BEV bit of the Status register. Bits 2:0 are an ID indicating the exception type. 


ID Exception 
000 NMI 
001 Debug Break 
010 Reserved 
O11 Reserved 
100 TLB Refill 
101 XTLB Refill 
110 Cache Error 
111 Others 


Long Sequential Execution (LSEQ) 
¢ Mnemonic LSEQ 
* Code 0,1,1,1 


The LSEQ packet indicates that 256 instructions have been executed sequentially. 
This is the limit of the sequential instruction counter. 


No Operation (NOP) 
¢ Mnemonic NOP 
* Code 0,0,0,0 


The NOP packet is output if there are no other packets while trace is enabled. It is 
also output if trace is disabled. 
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20.5:4.5 


20.5.4.6 
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Non-Sequential Operation (NSEQ) 


¢ Mnemonic NSEQ <seq #> 

¢ Code 1,0,0,0, <seq #> 
The NSEQ packet indicates the current value of the 8-bit Trace instruction counter 
(IC). It is output when a branch, jump, or exception occurs. The <seq #> field is 


the count of instructions since the last NSEQ or LSEQ occurred; the count starts 
at 0. 


Target PC (TPC) 
¢ Mnemonic TPC <program_counter> 
¢ Code 0,1,0,0,<program_counter> 
The TPC packet contains a 40-bit value representing the virtual address of: 


¢ The target address of a Jump Register instruction after an NSEQ 
packet 


¢ The new PC after an ERET instructi 
e The starting trace location when trace reset is released 


¢ The new PC when the processor leaves Debug mode whil N-Trace is 
enabled 


N-Trace instruction summary 


Table 20-16 summarizes the Trace instructions and the trace behavior that they 
create. The instructions are grouped according to the classifications that are 
defined as part of the N-Trace architecture. The instruction counter (IC) is a 
pointer that indicates the count of instructions after an NSEQ or TPC packet. This 
count starts at 0. The IC reported by a trace action is the IC of the instruction that 
caused the action. 
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Table 20-16 N-Trace Instruction Summary 


Instruction Set 


Instruction or Group 


Trace Action! 


CPU Instruction Set 


J, JAL 
(Action occurs in the delay slot) 


NSEQ <IC>; IC = 0; 


CPU Instruction Set 


JR, JALR 
(Action occurs in the delay slot) 


NSEQ <IC>; IC = 0; TPC; 


CPU Instruction Set 


CPU Instruction Set 


PC-Relative Conditional Branches 
(Action occurs in the delay slot for 
Branch Taken case) 


Exceptions and SYSCALL, BREAK 
instructions 


If (Branch Taken) 

NSEQ <IC>; IC = 0; 

Else (Not Taken) 

IC <- IC + 1; If dC = 256) 
LSEQ; IC = 0; 


NSEQ<IC>; EXP<cause>; IC = 
0; 


CPU Instruction Set 


Conditional Traps 


If (Trap Taken) 

NSEQ<IC>; EXP<cause>; IC = 
0; 

Else (Not Taken) 

IC <- IC +; If dC = 256) LSEQ; 
IC =0; 


CPU Instruction Set 


All other instructions 


IC <- IC + 1; If UC = 256) 


FPU Instruction Set 


LSEQ; IC = 0; 
CPO Instruction Set ERET NSEQ <IC>; IC = 0; TPC; 
. : . IC <-IC +1; 
CPO Instruction Set All other instructions If (IC = 256) LSEQ: IC =0; 
If (Branch Taken) 


Conditional Branches 
(Action occurs in the delay slot for 
Branch Taken case) 


NSEQ <IC>; IC = 0; 
Else (Not Taken) 
IC <- IC + 1; If dC = 256) 


LSEQ; IC = 0; 
F : : IC <- IC + 1; If (C = 256) 
FPU Instruction Set All other instructions LSEQ: IC = 0: 
Debug Instructions Debug Break or Break instruction NSEQ <IC>; IC = 0; 
Debug Instructions DRET IC = 0; TPC; 
Debug Instructions All other instructions No action 


Note: 


1. IC = instruction counter, a pointer indicating the number of instructions after an NSEQ or TPC packet. 
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Table 20-17 shows an example of a Break instruction with an exception handler 


instruction indicated as the target. 


Table 20-17 Trace Example #1 


IC Instruction Trace Packet(s) 
Break NSEQ<N>; EXP<cause> 
0 Target 
Target + 1 


For taken branches and Jump instructions, the PC is not redirected until the delay 
slot is executed. The NSEQ and IC reported is for the delay slot of the Branch 


instruction. Table 20-18 shows an example of a Branch instruction with a Target 
instruction target: 


Table 20-18 Trace Example #2 


IC Instruction Trace Packet(s) 
N-1 Branch 
Delay Slot NSEQ<N> 
0 Target 
1 Target + 1 


Table 20-19 shows an example of a Jump Register instruction with a Target 
instruction and target address. 


Table 20-19 Trace Example #3 


IC Instruction Trace Packet(s) 
N-1 Jump Register 
Delay Slot NSEQ<N>; TPC<TargetAddress> 
0 Target 
Target + 1 


784 


Vr5432 Microprocessor User’s Manual 


Debug and Test Features 


For Branch instructions not taken, the delay slot is always part of the instruction 
flow. The NSEQ and IC reported include delay slots of all Branch instructions. 
delay slot is included even for branch-likely cases where the architecture does not 
include it. For branch-likely cases, the delay slot is treated as an NOP. 


Vr5432 Microprocessor User’s Manual 785 


Chapter 20 


786 Vr5432 Microprocessor User’s Manual 


Subblock Data Retrieval Order 


A 


Data block elements (bytes, halfwords, words, or doublewords) can be retrieved 
from storage in either sequential or subblock order. This appendix describes these 
retrieval methods, with an emphasis on subblock retrieval order. 
Note: The Vr5432 processor requires external memory systems to 
retrieve data in subblock order. 


Sequential retrieval fetches data block elements in serial, or sequential, order. 
Figure A-1 shows an example of sequential retrieval, in which word 0 is taken first 
and word 3 is taken last. 


W 1 W2 W3 


FPL 


Word 0 Word 3 
retrieved first retrieved fourth 
Word 1 Word 2 
retrieved second retrieved third 


Figure A-I Retrieving a Data Block in Sequential Order 
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Subblock retrieval allows the system to define the retrieval order. Figure A-2 
shows retrieval of a four-word block; the critical word at the target address is 
retrieved first (W2), followed by the remaining words. (The smallest data element 
of a block transfer is a doubleword.) 


Retrieval Order 2 3 0 1 


i \ 


Wo W3 
retrieved third retrieved second 


wi Wwe2 
retrieved fourth retrieved first 


Figure A-2  Subblock Order Data Retrieval 


The subblock ordering logic generates an address for each word as it is retrieved 
by executing a bitwise exclusive-OR (XOR) of the starting block address with the 
output of a binary counter that increments with each word, starting at word zero 
(005). Using this scheme, Table A-1 through Table A-3 list subblock word 
retrieval for a four-word block, based on three different starting-block addresses: 
10, 115, and 015. The subblock order is generated by an XOR of the subblock 
address (105, 115, and 01>) with the binary count of the word (00, through 11>). 


Table A-1 Subblock Sequence: Address 10> 


Starting Block 
Address 


Word 
Retrieved 


Cycle Binary Count 
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Subblock Data Retrieval Order 
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Cycle as oe Binary Count Bs d 
1 11 00 11 
2 11 01 10 
3 11 10 01 
4 11 11 00 
Table A-3 Subblock Sequence: Address 014 
Cycle ee — Binary Count ee d 
1 01 00 01 
2 01 Ol 00 
3 
4 
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Comparing the VR4300, VR5000, and VR5432 


Processors 


Table B-1 compares the Vr4300, VR5000, and Vr5432 processor features. 


Table B-1_ Vr4300, Vr5000, and Vr5432 Feature Compariso 


Uncached 


Uncached 


Feature VR4300 VR5000 VR5432 
: Cached (write-back) 
; Cached (write-back) : 
Cache Algorithms Cached (write-back) Cached (write-through) Cached (write-through) 


Uncached 
Accelerated uncached 


Circuit Design 


Parity Support 
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Technique Dynamic Static Static 

Coprocessor 0 Hazards | Yes Yes No 

Dae 8 KB 32 KB 32 KB 

Array Size 

Daca Direct mapped 2-way set associative | 2-way set associative 
Associativity PP y y 

Data Cache No No Yes 

Line Locking (Lock bit/cache line) 
Data Cache 

Line Size 16 bytes 32 bytes 32 bytes 

Data Cache No Yes No 
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Table B-1_ Vr4300, Vr5000, and Vr5432 Feature Comparison (continued) 


Array Size 


Feature VR4300 VR5000 VR5432 
JTAG Boundary Scan 
Hardware Debug JTAG Boundary Scan |No N-Wire debug support 
Features Hardware breakpoints 
Instruction jamming 
Instruction Cache 16 KB 32 KB 32 KB 


Instruction Cache 


Direct-mapped 


2-way set associative 


2-way set associative 


Size 


Associativity 
Instruction Cache No No Yes 
Line Locking (Lock bit/cache line) 
Instruction Cache 
tine Sine 32 bytes 32 bytes 32 bytes 
Instruction Cache No Yes No 
Parity Support 
Instruction Fetch No No 4096 entries 
Branch Prediction 2-bit saturating counter 
MIPS IV + Rotate 
Instruction Set + DSP 
Architecture MES MIPSy (Integer MAC, etc.) 
+ Media 
Nonblocking 
Load/Store : : hits under misses 
Architecture Blockins Blocks Up to 4 outstanding D- 
cache misses 
Performance Counters Two 32-bit counters 
(Software/Code No No Selectable any 2 of 16 
Tuning) different events 
Physical Address 32 bits 36 bits 36 bits internal; 32 bits 


external 


Power-On 
Configuration Modes 


Dedicated pins 


Scan-in boot ROM 


Dedicated pins 


Secondary Cache 
Support 
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No 


Yes 


No 
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Processors 
Table B-1_ Vr4300, Vr5000, and Vr5432 Feature Comparison (continued) 
Feature VR4300 VR5000 VR5432 
Symmetrical 2-way 
fac (2 Integer 
Limited 2-way : ‘ 
Superscalar : + 2 Floating Point 
; } Scalar (Single Issue) | (1 Integer 
(Execution Units) #1 Floatin® Point) + 1 Load/Store 
F +1MAC 
+ 1 Media) 
System Interface 
Clock Divisors 1, 1.5, 2,3 2, 3,4,5, 6, 7,8 2, 2.5, 3,4 
System Interface No Yes No 
Parity Support 
R4000-like R4000 + tee 
System Interface (Remover Unused Additional Write Split Transactions, or 
Protocol ; R4000-like (in VR4300 
Encodings) Modes 


Emulation mode) 


System Interface 


32 bits address/data 


64 bits + parity 


32 bits + parity 


TLB 


Width multiplexed address/data address/data 
multiplexed multiplexed 
4 entries 
TLB 2 entries : 
Data Micro-TLB Ne (4 KB fixed page size) | KB-16 MB variable 
page sizes) 
ae 2 entries 2 entries 4 entries 
Instruction Micro- (4 KB fixed page size) |(4 KB fixed page size) (4 KB — 16 MB variable 


page sizes) 


32 double entries 


48 double entries 


48 double entries 


Size (largest segment) 


40 bits 
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Lew (4 KB-16 MB variable | (4 KB-16 MB variable | (4 KB—16 MB variable 
Joint (2nd Level) : . : 

page sizes) page sizes) page sizes) 
Virtual Address 


40 bits 


40 bits 
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C 


For noisy module environments, a phase-locked loop (PLL) filter circuit, as 
shown in Figure C-1, is recommended. In addition, the configuration shown in 
Figure C-2 is required for PLLCap input. 


10 pH 
MOE sane VecP 
10uF— 0.4unF— 100 pF — 
Vss VssP 


Figure C-1 PLL Filter Circuit 


PLLCap 


cae = 


© 
1 


VssP 


Figure C-2. PLLCap Circuit 
R1 = 1 KOhm, Cl = 400 pF, and C2 = 40 pF. All values shown are nominal. 


Minimum and maximum values are TBD. All components should be placed as 
closely as possible to the indicated pins. 
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Instruction Hazards 


This chapter identifies R5432 instruction hazards that occur with certain 
instruction and event combinations (such as pipeline delays, cache misses, 
interrupts, and exceptions). These hazards can cause unpredictable system 
behavior and malfunctions. 


Most hazards result from instructions modifying and reading state in different 
pipeline stages. Such hazards are defined between instruction pairs, not ona single 
instruction in isolation. Other hazards are associated with instruction restartability 
in the presence of exceptions. 
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For the following code hazards, the behavior is undefined and unpredictable. 
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Any instruction that would modify the PageMask, EntryHi, EntryLo0, 
EntryLol, or Random CPO registers should not be followed by 
TLBWR instruction. There should be at least two integer instruction 
between the register modification and the TLBWR instruction 


Any instruction that would modify the PageMask, EntryHi, EntryLo0, 
EntryLol1, or Index CPO registers should not be followed by 
TLBWI instruction. There should be at least two integer instructions 
between the register modification and the TLBWI instruction. 


Any instruction that would modify the Index CPO register or the 
contents of the JTLB should not be followed by a TLBR instruction. 
There should be at least two integer instructions between the register 
modification and the TLBR instruction 


Any instruction that would modify the PageMask or EntryHi CPO 

registers or the contents of the JTLB should not be followed by a 

TLBP instruction. There should be at least two integer instructions 
between the register modification and the TLBP instruction. 


Any instruction that would modify the EPC, ErrorEPC, or Status CPO 
registers should not be followed by an ERET instruction. There 
should be at least two integer instructions between the register 
modification and the ERET instruction. 


A Branch or Jump instruction is not allowed in the delay slot o 
another Branch/Jump instruction. This sequence is illegal in th 
MIPS architecture. 


The two instructions preceding a DIV, DIVU, DDIV, DDIVU, MULT, 
MULTU, DMULT, or DMULTU instruction should not read the HI 0 
LO registers. There should be at least two integer instruction 
between the register read and the register modification 
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address translation ... 81 
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TLB entry format ... 64 
64-bit 
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address translation ... 81 
FPU operations ... 150 
TLB entry format ... 64 


Vr5432 Microprocessor User’s Manual 


A 


address cycles ... 206 
Address Error exception ... 123 
address space identifier (ASID) ... 46 
address spaces 
64-bit translation of ... 50 
address space identifier (ASID) ... 46 
physical ... 47 
virtual ... 46 
virtual-to-physical translation of ... 47 
addresses ... 45 
addressing 
and data formats ... 10 
big-endian ... 10 
Kernel mode ... 56 
little-endian ... 10 
Supervisor mode ... 53 
User mode ... 51 
virtual address translation ... 81 
See also address spaces 
array, page table entry (PTE) ... 94 
ASID. See address space identifier 


B 


Bad Virtual Address register (BadV Addr) ... 


big-endian, byte addressing ... 10, 166 
binary fixed-point format ... 161 
bit definition of 

ERL ... 51, 53, 56, 101 

EXL ... 51, 53, 56, 101, 104, 113 


IE... 101 

KSU ... 51, 53, 56 
KX... 56, 101 

SX ... 53, 101 
UX... 51, 101 


branch delay ... 34 
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Branch instructions, CPU ... 9 
Branch instructions, FPU ... 167 
Break or Trigger I/O (BkTgIO*) Signal ... 762 
Breakpoint exception ... 131 
Bus Error exception ... 128 
byte addressing 
big-endian ... 10, 166 
little-endian ... 10, 166 


C 
Cache Error (CacheErr) register ... 110 


Cache Error exception ... 127 
Cache Error exception process ... 114 
Cause register ... 102 
central processing unit (CPU) 
data formats and addressing ... 10 
exception processing ... 91 
See also exception processing, CPU 
instruction formats ... 8 
instruction set 
overview ... 8 
instructions. See instructions, CPU 
interrupts ... 143 
See also interrupts, CPU 
memory management 
See also memory management 
operating modes ... 17 
registers 
See also registers, CPU 
System Control Coprocessor (CPO) ... 63 
transfers between FPU and CPU ... 165 


ckseg0 ... 62 
ckseg1 ... 62 
ckseg3 ... 62 
cksseg ... 62 


Clock interface ... 313 
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signals ... 24 


cold reset .. 


. 307 


Compare instructions, FPU ... 167 


Compare register ... 96 


compatibility 
DEC VAX... 10 
iAPX x86 ... 10 
IBM 370 ... 10 
MC68000 ... 10 


computational instructions, CPU ... 9 
64-bit operations ... 327 
formats ... 326 


computational instructions, FPU 


floating-point ... 167 


Config register ... 77 


Context register ... 94 
Control/Status register, FPU ... 153, 155 


conversion instructions, FPU ... 166 


coprocessor instructions ... 9 


Coprocessor Unusable exception ... 133 


Count register ... 95 


CPO. See System Control Coprocessor 


csseg ... 56 
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Data Access Breakpoint ... 754 
Data Breakpoint Address Mask register ... 758 
Data Breakpoint Address register ... 757 


Data Breakpoint Control/Status register ... 755 


Data Breakpoint Data Mask register ... 759 


Data Breakpoint Data register ... 758 


data cycles 


... 206 


data formats 


and addressing ... 10 


byte ordering ... 10 
data identifiers ... 243 
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DBA ... 757 

DBAM ... 758 
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